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Preface 


The  1993  International  Workshop  on  Computational  Electronics  took  place  at  Leeds  in 
England.  The  aim  of  the  workshop  is  to  allow  researchers  to  focus  on  the  computational 
aspects  of  solid-state  device  modelling.  This  encompasses  new  device  and  carrier  transport 
models,  numerical  and  computational  techniques,  computational  inq)lementation  of  transport 
theory  (including  quantum),  developments  in  modelling  electronic  strucmres,  new  techniques 
in  computer  visualization  of  models.  This  year  an  underlying  theme  associated  with  compound 
semiconductor  devices  was  adopted,  but  contributions  on  all  aspects  of  solid-state  modelling 
for  all  types  of  material  were  invited. 

The  workshop  consists  of  several  sessions,  with  invited  papers,  contributed  papers  and  poster 
sessions,  allowing  opportunity  for  questions  and  technical  interaction.  Tutorial  sessions  are 
included  at  the  start  of  the  meeting  to  allow  experts  to  address  developments  in  computational 
numerical  techniques. 

This  International  Workshop  on  Computational  Electronics  follows  the  very  successful  series 
of  workshops  held  in  previous  years  in  the  United  States  of  America.  The  earlier  workshops 
were  organised  by  the  National  Center  for  Computational  Electronics  (NCCE)  and  supported 
by  the  US  National  Science  Foundation.  This  year  we  are  grateful  to  the  US  Office  of  Naval  ( 

Research,  the  US  Army  European  Research  Office  and  the  Department  of  Electronic  and 
Electrical  Engineering  at  The  University  of  Leeds  for  their  support. 

We  are  grateful  to  the  members  of  the  Programme  Committee  for  reviewing  the  contributed 

papers  and  to  the  Advisory  Committee  for  their  helpful  comments  and  suggestions.  Over  60  i 

contributed  papers  were  submitted  this  year.  The  very  high  standard  of  these  papers  has  led 

to  a  unique  collection  of  contributions  in  this  area.  We  are  grateful  to  the  invit^  speakers  for 

providing  keynote  addresses. 
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Numerical  Modelling  of  Microwave  Devices 


Eric  A  B  Cole 

Department  of  Applied  kiathentaticai  Studies. 

University  of  Leeds, 

Leeds  LS2  9JT,  UK 

Abstract 

Various  numerical  routines  are  discussed  and  applied  to  the  numerical 
simulation  of  a  planar  sub-micron  gate  length  GaAs  MESFET.  The  model  contains 
energy  transport  and  degenerate  statistics.  A  generalisation  of  the 
Scharf etter-Gummel  method  is  given  which  enables  both  the  electron  current 
density  and  energy  flux  to  be  easily  coded.  Degenerate  statistics  are 
included  for  the  case  in  which  the  relaxation  time  is  proportional  to  E*^. 
Steady  state  results  are  presented  -  these  are  arrived  at  both  by  iterating 
the  transient  solution  and  the  direct  method  based  on  a  modified  Newton 
method. 


I  Introduction 


In  this  paper  we  look  at  the  finite  difference  approach  to  the  modelling  of 
a  two-dimensional  MESFET.  Other  methods  -  the  Finite  Element  method  [1,2], 
the  Boundary  Element  method  and  the  Multigrid  method  [3]  can  be  found 
elsewhere.  To  illustrate  the  general  theory  developed  in  Section  IV,  we  will 
consider  the  simulation  of  a  planar  sub-micron  gate  length  GaAs  MESFET  whose 
cross  secticm  in  the  x-y  plane  is  shown  in  figure  1,  with  the  ends  of  the 
source,  gate  and  drain  at  x»s  ,  s  ,  g^  8  .  and  d  .  The  electron  density 
n,  electron  temperature  T  and  potential  ^  will  all  be  functions  of  x,  y  and 
time  t.  The  equations  are: 

(i)  the  Poisson  equation 


vV 


q 

-(N  -  n) 


(1) 


where  c  is  the  product  of  the  permittivity  of  the  vacuum  and  the  relative 
permittivity  of  the  semiconductor.  This  equation  is  to  be  solved  with  the 
boundary  conditions  <(»»V  on  the  source,  ^►■V  on  the  drain,  on  the 

s  4  S  b 

gate  where  is  the  built-in  potential,  and  dii/dn  ■  0  at  other  parts  of  the 
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Fig.  1.  Cross  section  of  the  MESFET. 


boundary  (In  this  paper,  n  will  represent  the  iinlt  vector  normal  to  the 
surface).  The  electric  field  is 

E  »  ;  (2) 

(»*>  the  current  continuity  equation  (neglecting  recombination) 

-  iy.J  (3) 

at  q  ~ 

where  the  current  density  J  will  have  the  form 


J  =  a  n(y^)  +  p  T  (yn)  ♦  y  n(yT  ) 

C  C  «  C  6 


(4) 


for  certain  coefficients  a  ,  P  and  y  .  The  boundary  conditions  taken  here 

c  c  c 

are  n*2.5N  on  the  source  and  drain,  n=0  on  the  gate,  and  an/an=0  elsewhere; 

d  ^ 

(iii)  the  energy  transport  equation 


aw 

at 


J.E 


(W-W  ) 
_ 0 

T(?) 


(5) 


where  ^  is  the  (position  and  time-dependent)  average  electron  energy,  W^n^, 
t(^)  is  the  energy-dependent  relaxation  time,  and  s  is  the  energy  flux 


s  =  a^W(y^)  ♦  p^T^(yW)  +  y^W(yT^)  (6) 

for  certain  coefficients  a^,  and  y^.  The  forms  of  all  the  coefficients 
will  be  discussed  more  fully  in  section  V  when  particular  statistics  are 
introduced. 

Having  obtained  a  set  of  modelling  equations,  they  should  always  be 
scaled  to  ensure  a  good  numerical  range  for  the  variables  [1,4]. 
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II  Finite  Differences 

We  will  deal  only  with  the  special  case  of  a  non-uniform  rectangular  mesh. 
More  general  techniques  including,  for  example,  mesh  refinement  [1,5]  and  box 
generation  [6]  are  described  elsewhere.  Consider  the  two-dimensional 
rectangular  mesh  shown  in  figure  2.  Mesh  points  will  be  labelled  0,1,2,. .M 
and  0,1,2,. .N  in  the  x  and  y  directions  respectively.  The  general  mesh  point 
will  have  coordinate  (x  ,y ).  The  variable  mesh  spacings  will  be  h  *x,  -x. 

1  J  r  I  J 

(i»0,..M-l)  and  (j*l"N-l).  The  case  of  uniform  mesh  is  given  by 

h^sBh=const.  and  kj=k=const.  The  value  fCx^.y^)  of  any  function  f  will  be 

denoted  shortly  by  f  .  while  its  value  at  the  half-points  (x  +h  /2,y  ), 

l»j  ^  ^  J 

(Xj,y  +k  /2)  and  *')t./2)  will  be  denoted  by  j’  j+1/2 

f  respectively. 

l+l/2,J+l/2 


Fig.  2.  The  mesh. 

Standard  formulae  exist  for  first  and  second  order  derivatives  [7].  The 
important  ones  for  this  simulation  are  listed  below  (resorting  to  the 
one-dimensional  case  temporarily  to  ease  the  notation).  The  half -point 
formula  for  the  first  derivative  of  f  is  given  by 

,  f  -  f 

f  .  - ilill-  (7) 

‘  (hj  ♦  hj  j)/2 

where  the  next  term  -(h^-h^  ^)fj"/4  has  been  neglected.  This  is  useful 
for  evaluating  divergences.  The  second  order  derivative  at  x  can  be  written 


f'  -  f' 

l*X/2  1-1/2 

(hj  +  hj  j)/2 


(hj  +  hj  j)/2 


and  the  neglected  term  here  is  (h  -h  )f  ' "  /3. 

11-11 

We  often  need  an  expression  for  the  normal  derivative  at  a  boundary,  in 
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which  case  it  is  very  often  zero.  Faise  nodes  can  be  introduced  outside  the 
boundary  but  this  increases  storage  requirements.  Aitemativeiy,  this 
derivative  may  be  expressed  at  x^O  in  terms  of  vaiues  at  internal  nodes  by 


f  ' 
0 


h  (h  +2h  )f  -  (h  ♦h  )*f  ♦  h  *f 

11  0  0 _ oil  02 

h  h  (h  -i-h  ) 

0  10  1 


(8) 


with  a  similar  expression  at  x^x^. 

Returning  to  the  case  of  functions  of  two  variables,  the  Poisson  equation 

2 

can  be  discretised  in  the  form 


-it  ^  -it  it  -It  t  -it 

i*i.J  M.J  i.J  M-i.J  i.J*i  ’^i.J  i.J  ’^i.J-i 


(hj+hj  ^)/2 


(k^.k^_^)/2 


In  the  case  of  uniform  mesh  this  reduces  to  the  standard  5-point  formula 
‘‘Vj  *  *  ".4-.  *  V  ■ 

where  p«k/h.  This  equation  has  an  error  of  order  h*,  while  the  more  accurate 
9-point  formula,  with  error  of  order  h*  is  (81 


III  Solution  of  Simultaneous  Equations 

The  discretisation  of  the  differential  equations  of  the  model  generally  gives 
rise  to  the  problem  of  solving  a  set  of  M+1  analjrtical  equations  of  the  form 

fj(X^.Xj...,Xj^)  -  0  (i-0,l,..,lli).  (12) 

where  the  li+1  values  X^  will  represent  typical  physical  quantities  at  each 
point  i.  If  there  are  physical  quantities  described  at  each  point,  then  the 
top  index  II  is  replaced  by  i^(lf-t-])-l.  The  method  of  solution  will  depend  on 
whether  all  the  equations  are  linear  or  at  least  one  equation  is  nonlinear. 


A.  Linear  Equatims 

When  equations  (12)  are  linear  they  take  the  form 


AX  -  b  (13) 

where  A  is  an  (M-t-DxiM-fl)  matrix  and  b  is  an  (li<»-l)  component  vectcH*,  both 
with  elements  which  may  depend  on  X.  These  can  be  solved  by  direct  methods  or 
by  indirect  methods  using  iteration. 


(a)  Direct  Methods.  Very  often  the  matrix  is  tridiagonal  and  a  simple 
solution  is  available.  Suppose  that  the  equations  have  the  form 


aX 

1  1-1 


^X 
•^1  1 


rx 

1  !♦! 


(i-0.1....M) 


and  the  terms  involving  and  do  not  appear.  The  standard  method  of 
eliminating  down  from  the  first  equation  and  solving  the  resulting  final  two 
equations  for  X^  ^  and  X^  gives  the  solution 


X,  .  0;)-(v  -r,x,.,)  ( 


(14) 


where  the  P'  and  d'  are  generated  by 


P  , 

0  0 


B'  =  8  -  a  (B'  )'*y 

^1  *^1  1^1-1  'i-i 


a' 


a  -  a  (8'  )‘‘a' 

1  1  1-1  1-1 


(i*l...,M-l). 


Efficient  coding  of  this  routine  is  available  (9,10).  We  can  usually  get 
away  without  pivoting  in  this  case  because  the  simulation  will  probably  not 
cause  problems  with  the  sizes  of  the  elements  of  the  tridiagonal  system.  In 
one-dimensional  simulations  which  involve  v  physical  variables  at  each  node 
(Tea"  example,  iw3  when  the  variables  are  n,  p  and  tft),  the  equations  at  each 
node  can  be  groiqied  so  that  equations  (14)  still  hold.  In  that  case,  however, 
the  X^,  a^  and  a'  are  v-conqxHient  vectors  and  the  quantities  8j>  7^  and 
8'  are  vxv  matrices.  The  multiplications  in  equations  (14)  are  in  the  correct 
order  for  matrix  multiplication  111). 

When  A  is  not  simply  tridiagonal,  LU  decomposition  with  partial  pivoting 
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m  m  _  9  9  9  9  9 


and  Kaling  to  control  the  growth  of  rounding  errors  may  be  used  [10.12.131- 
Rounding  errors  in  direct  methods  can  oftm  be  eliminated  by  solving  the 
equation  Ae«b-AX  for  the  correcticm  e  to  the  conmuted  solution  X  . 


(b)  Relaxation  Methods.  Consider  the  iterative  {»*ocess 

♦  c  (15) 

where  the  iteration  matrix  B  is  an  (M4l)x(M-*-l)  matrix  with  constant 
coefficients.  If  the  solution  converges  to  X  then  equation  (15)  is  equivalent 
to  the  equation  (I-B)X«c.  Convergence  may  be  slow  if  the  spectral  radius  of 
B  is  close  to  unity,  so  instead  take  a  modified  iteration  process 


rJc-(I-B)X*j 


and  adjust  the  value  of  the  parameter  w  so  that  the  spectral  radius  of  B'  is 
as  small  as  possible.  This  is  the  basis  of  the  relaxation  method. 

In  our  original  problem  given  by  equation  (13).  we  may  write  A=Lp+D+U^ 
where  D  is  diagonal  and  and  are  lower  and  upper  triangular  with  zeros 
on  the  diagonals.  This  decomposition  suggests  the  Jacobi  iteration 

DX^**  =  b  -  (L +U  IX* 

~  ~  0  0  ~ 

with  iteration  matrix  B=»-D*^(L  +U  ).  and  the  Gauss-Seidel  iteration 

0  0 

L  +D)X‘*‘  -  -U  X^  +  b 
0  ~  ~ 

with  iteration  matrix  B»-(L  +D)'*U  .  It  can  be  shown  [9,14.15]  that  the 

0  0 

scheme  (16)  is  convergent  only  for  (Kw<2.  If  (Kwa  then  we  have  Successive 
Under-  Relaxation  (SUR)  while  if  l<w<2  we  have  Successive  Over-Relaxation 
(SOR).  Further,  the  Jacobi  spectral  radius  for  equation  (10)  is 


2 

p  cos 


♦  cos 


1  +  p‘ 


with  optimal  choice 


1  ♦  (1-p/)^* 


This  is  not  the  exact  value  for  the  equation  (11)  but  it  very  often  works 
well  enmigh-  See  also  [16].  The  value  of  w  may  also  be  changed  as  the 
it«rati<»i  progresses  [17],  for  example,  Chebyshev  Acceleration. 
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B.  Noo-Un— r  Equati<Mis 

The  ctaadard  method  h«^  is  the  Newton-Raphson  method.  If  X  and  X«X  -«-6X  are 

~c  ~  ~c  ~ 

the  computed  and  true  solution,  expansion  of  (12)  to  first  order  gives 


J«X  »  -f(X  ) 


where 


ij  ax 


is  the  J.ux>bian.  This  is  a  set  of  linear  equations  which  can  be  solved  (among 
other  methods)  using  the  iteration  scheme 


/ax^  -  -f(X^)  . 


X^**  *  X^  +  adX^ 


where  a  is  taken  such  that  (Koc^l  to  avoid  overshoot.  It  is  usually  necessary 
to  take  a  small  in  the  early  stages  of  the  iteration  and  then  to  steadily 
increase  it  as  the  iteration  progresses. 

An  alternative  Newton  method  has  been  found  to  make  iterations  go  more 
smoothly.  Instead  of  solving  for  we  solve  directly  for  : 

=  JCX^  +  «a^)  «  -  af*'  .  (17) 

The  advantages  here  are  that  we  can  usually  take  larger  values  of  a  and  that 
fewer  initial  guesses  need  to  be  made.  When  (1^0,.. M)  then  (17)  becomes 


with  corresponding  iteration  process 

X*'**  »  (l-w)x'‘*‘  +  wJ  ^[(-af  +  JX^ij  -  EJ  xH  (18) 

where  f*f(^)  and  ^JIX*').  If  X  represents  a  natural  grouping  of  v  quantities 
at  each  point  (that  is,  X^  is  a  v-compone  it  vector)  then  equation  (18)  still 
holds  with  being  a  nonsingular  matrix,  and  w  as  a  diagonal  matrix 
with  separate  relaxation  factors  down  the  diagonal. 

Evaluation  of  the  Jacobian  can  be  the  most  time-consuming  part  of  the 
operation.  It  is  possible  to  differentiate  the  functions  f^  numerically  using 

a f  j  f  (X^, . . , X^+AX^, . . .  Xj^)-f  (X^, . .  X^, . .  X^) 


vdiere  the  AX^  are  suitable  increments.  If  the  AX^  are  too  small  then  roundoff 
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ernv^  can  swamp  the  calculation,  while  convergence  will  be  linear  if  they 
are  too  large  (18).  Methods  could  be  used  in  which,  say,  standard  functions 
are  differentiated  explicitly  while,  for  example,  mobility  curves  fed  in  from 
other  simulations  could  be  differentiated  numerically.  Also  using  Broyden’s 
rank  one  correction  it  is  not  necessary  to  evaluate  J  at  each  iteration  [19]. 

rv  Discretising  the  Current  Continuity  and  Energy  Equations 


An  explicit  time  discretisation  of  the  current  continuity  equation  will  only 
work  if  the  timestep  At  is  excessively  small  (20),  while  a  fully  implicit 
Crank-Nicholson  scheme  (12)  is  difficult  to  solve.  The  linearised 
semi-implicit  scheme  (21)  is  more  satisfactory  unless  At  is  taken  too  large. 
This  limitation  can  be  avoided  if  we  use,  in  the  case  of  constant  T  ,  the 

e 

Scharfetter-Gummel  method  in  which  we  take  an  exponential  variation  in  the 
carrier  deviations  between  nodes  (22,231. 

The  following  is  a  generalisation  of  the  Scharfetter-Gummel  method  when 
T^  is  not  constant  throughout  the  device.  Equations  (4)  and  (6)  can  be 
written  in  general  form 

y  =  a(v<^r)0  +  pTjve)  +  romj  (19) 

where  (y«J,  wn)  for  the  current  density  and  (y«s,  0*W;  for  the  energy  flux. 
For  numerical  purposes  only  we  now  make  the  assumptions  that,  in  the  interval 
(i,j)-*(i+l,j),  the  quantities  7</i,  VT^,  p,  a/p  and  y/p  are  constants.  The  x- 
component  of  equation  (19)  becomes 


ax  Ip  T  PjT  p  T  8x1 

^  •  e  J 


1+1/2,J 


which  has  integrating  factor 


-[f(i  {M-  *  ? 
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1*1/2, 
F 


-[*'y/3(  ^1*1. j’ ^i.j]®»*».j 

~^7/fi  1  .j“^l.  J  ^l.J  ’^1*1  .j]  ®i.j] 


(21) 


with  a  similar  expression  for  (V  ).  ..  -  Here, 

y  l,J*l/2 

by 

C(x.,.z)  . 

where  p(y,z)  ■  ( z-y )  and  B(t)  ■  t 

TnTzTyT  -  1 

One  important  property  of  this  C-function,  namely 


C  is  the  function  defined 

(y»z)  (22) 

is  the  Bernoulli  function. 


WC  (x.T  ,T)  »  T  B(x/T^) 
•  0  0  0 

0 


is  usef ul  for  applying  to  models  in  which  the  electron  temperature  is  a 
constant  T^.  and  the  original  Scharfetter-Gummel  expression  is  then  obtained. 

It  is  necessary  to  avoid  overflows  and  underflows  when  evaluating  the 
C-functions.  The  usual  method  (1]  is  to  make  a  piecewise  machine-dependent 
approximation  for  B(t).  The  derivative  of  the  Bernoulli  function  can  be 
approximated  by 


B' (t)  - 


f  (t-2)/4 


V 


for  t  s  t  s  t 

2  3 

Otherwise. 


V  Evaluation  of  the  Coefficients 

The  Fermi  integral  F^(t))  is  defined  as 

m  m 

F^(i})  ■  r(r+l)'’Jy'^{l  e^"*)'^dy  where  r(r+l)  ■  Jy*^e~^dy 

0  0 

This  function  has  been  extensively  approximated  and  tabulated  [24].  Write  k 

8 

as  Boltzmann’s  ccmstant,  h^  as  Planck’s  c<»stant,  p  as  the  mobility,  m^  as 

the  effective  electron  mass,  ipIE  -E  )/(k  T  ),  and  take  the  energy  dependence 

cf  the  relaxation  time  as  t«(E-E  f.  The  first  few  moments  of  the  Boltzmann 

c 

transport  equation  then  give  [25,26] 


19 


where 


n  -  AT  Ji») 
•  1/2 


A  ■  2(2«k  m  )^/h  ^  , 

B  •  P 


W  -  ^  T  DF(ii)/rii,)  . 

2  B  •  3/2  '  1/2 

J  -  Mn(-qV^  +  k  T  Vij)  ♦  I  (VT  )/T  , 

~  g,  2«e 


where 


qs  ■  -I  (-qV^  +  k  T  Vt»)  -  I  (VT  )/T 

~  2  B*  3e« 


>3  ■ 


r+3/2 


After  extensive  manipulation  and  using  the  constant  effective  mass 

approximation,  it  is  found  that  J  and  s  have  the  forms  (4)  and  (6)  where 


a  s*  -qp 
c 


^F^(F  .^F^))/q 

and  Fj»Fj(7})  throughout. 

At  certain  stages  of  the  numerical  implementation  it  is  necessary  to  find 
1)  and  at  each  point  given  the  values  of  n  and  W.  From  the  above,  it  may  be 


shown  that 


-  54h,^8inn  )’^  n®/W^ 

1/2  '  3/2  P  •  ' 


and  this  may  be  inverted  to  give  t|  and  hence  T  from 


Conversely,  to  find  ri  and  W  given  n  and  T^,  i)  may  be  calculated  at  each  point 
bj  inverting  the  equation 
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F  ^n)  -  n/{KT^)  (24) 

and  calculating  W  frtHn 

*  •  • 

Both  these  inversions  must  be  done  at  each  grid  point.  Once  the  functions 
and  F^  have  been  programmed,  inversion  is  straightforward  since  the 
left  hand  sides  of  equations  (23)  and  (24)  are  strictiy  increasing  functions 
of  n. 


VI  Implementation  for  the  MESFET 

We  now  apply  the  preceeding  results  to  the  simulation  of  the  MESFET  model 
outlined  in  section  I.  Referring  to  figure  1,  the  dimensions  taken  were 
s^«O.OMm,  s^»0.2pm,  g^»K).6pm,  g^^Llpm,  d^*1.6jim  and  d^=1.8pm.  The  total 
thickness  including  substrate  was  0.4Sfim.  An  abrupt  junction  at  y»0.3S)im  was 
taken  with 

10^  m  ^  for  0|im  s  y  s  0.35pm 
10*^  m'^  0.  35pm  <  y  a  0.  45pm. 


Monte  Carlo  simulations  and  experimental  data  on  the  steady  state  transport 
characteristics  provide  f  %nres  of  C  and  x  in  terms  of  the  static  electric 
field  E^  which  is  used  as  an  intermediate  parameter.  This  enables  x  to  be 
found  in  terms  of  The  mobility  is  given  by 


300p^ 


1  +  8.5xlO*E^ 

p  (l-5.3xlO’^T  )E  * 
0  0  0 

0 


where 


0.8 

1  +  (N 

d 


and  E»4xlO®Vm  All  boundary  conditions  8/dn^  were  implemented  using 
equation  (8)  and  its  equivalents  with  all  derivatives  zero.  The 
quantities  J  and  s  were  coded  at  the  half  points  using  appropriate  values  in 
exfnassion  (21).  Three  solutions  were  performed;  the  transient  solution,  the 
steady  state  reached  by  iterating  the  transient  solution,  and  the  direct 
solution  of  the  steady  state. 

(a)  The  transient  solution.  A  timestep  At»10~**s  was  used.  The  Poisson 
equation  (11)  was  used  to  extract  an  expression  A  for  d*  in  terms  of  the 

UJ  IsJ 

other  quantities,  and  was  solved  by  iterating  the  equation 
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(1-w  M  ♦  w  A 

p*i  ^i.j  p«i  I.J 


The  current  continuity  and  energy  equations  (3)  and  (5)  vrere  simiiarly  soived 
by  coding  the  C  functions  with  the  appropriate  quantities  It  was 

found  that  the  separate  reiaxation  factors  w  «1.4,  w  *0.8  and  w_»0.5  gave 

p«l  n  T 

the  fastest  iteration.  At  each  timestep,  the  three  equations  were  iterated 

separately  inside  an  overall  iteration.  A  mixture  of  convergence  tests  was 

used.  In  the  case  of  the  ^  iteration,  we  required  convergence  at  each  point, 

that  is,  we  required  that  1 0  -^1  |  had  to  be  less  than  some  prescribed 

value  at  every  point,  where  01  was  the  value  at  the  previous  iteration.  The 

same  test  was  applied  to  the  electron  temperature  T^.  In  the  case  of  the  n^  ^ 

we  applied  a  weaker  test  by  requiring  that  only  the  average  relative 

differraice  V  |  n  -nl  |  /n  be  smaller  than  some  prescribed  value.  It  was 
•->  i,J  i,J 

found  that  this  average  condition  gave  a  smoother  time  plot  of  the  total 
current 


0.45 

■‘tot  “  J  (•‘x  *  c^E^'^^tldy  . 
0 


(b)  Iteration  to  the  steady  state.  The  method  of  (a)  was  used  but  with  no 
iteration  of  the  equations  at  each  timestep.  A  total  of  2000  timesteps  was 
used,  giving  a  device  time  of  20ps.  Results  for  0.  n  and  T^  are  shown  in 
figure  3  (viewed  from  corner  A  in  figure  1)  for  the  case  r*-l. 

(c)  Direct  method.  A  natural  grouping  X  *(0  ,  n  ,  T  exists  at 
each  gridpoint  (i.j).  Put  all  d/dt  terms  zero  in  equations  (1),  (3)  and  (5) 
and  write  the  equations  generally  as  0^*0,  G^»0  and  G^=0.  The  Newton  method 
applied  at  each  point  gives 


Aax  ^  BAX  +  cax  oax  *  Eax 

~I.J  ~I,M 

where  A  sdG  /S0  etc  are  3x3  matrices.  Writing 

ml  m  I-l.J 

modified  Newton  method  (18)  becomes 


“G 

y  aX  +aaX  ,  the 
*i.J  ~Ui 


k+l 


(I-w)y''*^  ♦  wC‘M-aG  +  AX^  +  BX*  +  CX*‘  ♦  DX^  +EX^ 

*i.j  {  ~  ~i.J  ~i.j+i 

*  k  +  l  _k  +  l  1-  I***  1 

~  ^^1  -i.f  ♦i,j~ 


where  w  is  the  diagonal  matrix  with  nonzero  elements  w  ,  w  and  w  .  This 

p«i  n  T 

routine  is  fragile  in  the  early  stages,  and  for  the  first  few  Newton 

iterations  it  is  necessary  to  take  a  very  small  (typically  0.01),  n  to  be 

kept  positive  and  T  to  be  not  less  than  T  . 

•IJ  0 
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Abstract 

The  classical  and  quantum  hydrodynamic  equations  are  presented  in  a  unified  formula¬ 
tion  and  the  3D  transport  equations  are  mathematically  classified.  The  ID  steady-state 
classical  and  quantum  equations  are  discretized  in  conservation  form  using  an  upwind  ( 

method.  A  classical  hydrodynamic  simulation  of  a  steady-state  electron  shock  wave  in  a 
one  micron  Si  semiconductor  device  at  77  K  is  presented  and  compared  with  a  DAMO¬ 
CLES  simulation  of  the  Boltzmann  equation.  Quantum  hydrodynamic  simulations  of  a 
resonant  tunneling  diode  are  presented  which  show  charge  buildup  in  the  quantum  well 
and  negative  differential  resistance  in  the  current-voltage  curve.  ^ 


I.  Introduction 


Electron  propagation  in  a  semiconductor  crystal  is  well  modeled  down  to  submicron  scales 
by  the  classical  hydrodynamic  model.  The  classical  hydrodynamic  equations  can  be  ex¬ 
tended  to  include  quantum  effects  by  incorporating  the  first  quantum  corrections.  These 
0{h^)  terms  allow  particle  tunneling  through  potfsntial  barriers  and  particle  buildup  in 
potential  wells. 

The  aim  of  this  paper  is  to  give  a  unified  presentation  of  the  classical  and  quantum 
hydrodynamic  conservation  laws  and  of  their  mathematical  classification  and  numerical 
discretization.  I  will  also  present  a  cleissical  hydrodynamic  (CHD)  simulation  of  an  elec¬ 
tron  shock  wave  in  a  one  micron  Si  n*  —n—n*  device  at  77  K  and  quantum  hydrodynamic 
(QHD)  simulations  of  resonant  tunneling  in  an  GaAs/AlxGai-cAs  diode  at  77  K. 

The  classical  and  quantum  hydrodynamic  conservation  laws  have  the  sztme  form: 

dt  axi' 


dt  ^  dXi 


dW 

dt 


k)  =  0 

(1) 

dV 

Pi 

(2) 

~  ”ax^ 

dV 

{W-^nTo) 

(3) 

-nui-T - 

ax, 

Tw 
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where  n  is  the  electron  density,  u  is  the  velocity,  p  is  the  momentum  density,  Pij  is 
the  stress  tensor,  V  =  — is  the  potential  energy,  is  the  electric  potential,  e  >  0  is 
the  electronic  charge,  W  is  the  energy  density,  q  is  the  heat  flux,  and  To  is  the  lattice 
temperature  in  energy  units  (k^  is  set  equal  to  1).  Indices  i,  j  equal  1,  2,  3,  and  repeated 
indices  are  summed  over.  E^.  (1)  expresses  conservation  of  electron  number,  E^.  (2) 
expresses  conservation  of  momentum,  and  Exj.  (3)  expresses  conservation  of  energy.  The 
collision  terms  in  Eqs.  (2)  and  (3)  are  modeled  here  by  the  standard  relaxation  time 
approximation,  with  momentum  and  energy  relaxation  times  Tp  and 

The  classical  and  quantum  hydrodynamic  equations  can  be  derived  from  a  moment 
expansion  of  the  Wigner-Boltzmann  equation.  The  classical  equations  are  obtained  by 
setting  A  =  0.  To  close  the  moment  expzmsion  at  three  moments,  we  have  to  define  e.g. 
p,  Pij,  W,  auid  q  in  terms  of  n,  u,  and  T,  where  T  is  the  electron  temperature. 

In  the  simplest  approximation,  the  heat  flux  is  specified  by  the  Fourier  law  q  =  — «VT. 
For  the  “momentum-shifted”  thermal  equilibrium  Wigner  distribution  function,  the 

momentum  density  p  =  mnu,  where  m  is  the  effective  electron  mass,  the  stress  tensor  is 
given  by 

=  +  +  (4) 

and  the  energy  density  by 

W  =  ^nT  +  imnu^  -  log(n)  -I-  .  (5) 

I  derived  the  full  three-dimensional  quantum  hydrodynamic  model  for  the  first  time  by  a 
moment  expansion  of  the  Wigner-Boltzmann  equation  in  Ref.  [1].  The  quantum  correction 
to  the  energy  density  was  first  derived  by  Wigner  (2].  The  quantum  correction  to  the  stress 
tensor  was  proposed  by  Ancona  and  Tiersten  (3]  on  general  thermodynamical  grounds  and 
derived  by  Ancona  and  lafrate  [4]  in  the  Wigner  formalism.  In  the  one-dimensional  case, 
the  3D  QHD  equations  reduce  to  the  QHD  model  of  Grubin  and  Kreskovsky  [5]. 

The  actual  expansion  paurameter  in  the  QHD  equations  is  h^/SmTP,  where  /  is  a 
characteristic  length  scale  of  the  problem  [4].  For  the  resonant  tunneling  diode  simulations 
in  section  V  with  T  »  To  =  77  K  and  I  =  100  A,  the  expauision  parameter  fa  0.23. 

The  transport  equations  (l)-(3)  are  coupled  to  Poisson’s  equation  for  the  electric 
potential  energy 

V-  (eVV^)  =  e^No  -Na-ti)  (6) 

where  e  is  the  dielectric  constant.  No  is  the  density  of  donors,  and  Na  is  the  density  of 
acceptors. 


n.  Classification  of  the  Hydrodynamic  Equations 

To  classify  the  hydrodynamic  equations,  rewrite  the  hydrodynamic  equations  (l)-(3)  and 
(6)  (with  Tp,  Ty,  —*  oo)  in  terms  of  n,  u,  and  T: 


dXi 


(nui)  =  0 


(7) 
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duj  dui  Id.  h}  d  (  ^  1  /  \\ 

IT  ~  12m>n  dxi  y'dxidxj  °*^"V 

^4.  .^4.  _  i. A  f  ^ 

dt  **‘di,  3  di,  3ndxi  \  dxj/ 


I  dv  „ 

+  -— =  0 


m  dxj 
(nV’ui)  =  0 


(8) 


36mn  dx/ 

-  V(cVV)  +  e\ND  -NA-n)^0 

Then  linearize  the  PDEs  (7)-(10)  with  respect  to  a  Fourier  mode  perturbation,  and 
freeze  coefficients.  Set 


(9) 

(10) 


n 

■  fi  ■ 

Sn 

T 

= 

TTi 

T 

Sui 

ST 

.  V  . 

,V  . 

.  sv . 

(11) 


where  [?r,  li?,  T,  is  a  solution  of  the  hydrodynamic  equations. 

Next  write  the  linearized  hydrodynamic  equations  in  terms  of  the  symbol  S  of  the 
linearized  PDE  system  (7)-(10)  as 


-  diag{<T,  a,  <r,  <t,  <t, 


0}  [6n,  Sui,  ST,  6V]  +  S[6n,  Sm,  ST,  =  0 


(12) 


As  |k|  — »  oo,  5  —  diag{<T,  trSij,  a,  0}  has  the  form 


ik  •  u  —  a 

ikiu 

0 

0 

Ji 

(zk  •  u  -  a)Sij 

zk  •  u  -f  —  a 

0 

0 

0 

0 

(13) 


where  t  labels  columns  in  the  velocity  perturbation  Sui  and  j  labels  rows  in  the  velocity 
equation  (8).  I  have  dropped  the  bar  over  the  solution  [n,  Uj,  T,  V],  and  have  separately 
kept  the  leading  terms  in  k  for  the  limits  h  —*  0  and  k  — »  0. 

The  mathematical  type  of  the  PDE  system  is  determined  by  the  asymptotic  eigenvalues 
ff  of  the  symbol  as  |k|  — »  oo.  We  need  only  consider  the  upper  5x5  block  Sb  of  the 
symbol,  since  the  coupling  of  the  transport  equations  (7)-(9)  to  Poisson’s  equation  (10) 
only  introduces  the  elliptic  Poisson  mode,  and  does  not  affect  the  modes  of  the  transport 
equations. 

There  are  three  physically  interesting  cases  to  consider: 


(l)ft  =  0,  K  =  0  (electrogasdynamics).  The  eigenvalues  of  the  symbol  Ss  amd  correspond¬ 
ing  modes  are 

J  i(k  •  u  ±  Arc)  hyperbolic 

\  zk  •  u  hyperbolic,  multiplicity  3  ^ 

where  c  =  ^STJZm.  There  are  five  nonlinear  waves  in  classical  electrogasdynamics  cor¬ 
responding  to  the  five  hyperbolic  modes:  two  shock  waves  and  three  contact  waves.  Two 
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contact  waves  can  be  labeled  by  a  jump  in  the  tangential  velocity  U(  across  the  wave,  and 
one  contact  wave  by  a  jump  in  the  temperature  T. 

(2)  =  0,  K  >  0  (hydrodynamic  model  or  electrogasdynamics  with  heat  conduction).  The 

eigenvalues  of  the  symbol  Ss  and  corresponding  modes  are 

{t(k  ■  u  ±  kc)  hyperbolic 

tk  u  hyperbolic,  multiplicity  2  (15) 

^k^Kjn  +  ik  •  u  parabolic 

where  c  =  ^T/m.  With  heat  conduction,  there  are  four  nonlinear  waves  [6]  in  the  CHD 
model  corresponding  to  the  four  hyperbolic  modes;  two  shock  waves  and  two  contact 
waves.  The  two  contact  waves  can  be  labeled  by  a  jump  in  the  tangential  velocity  Ut 
across  the  wave.  The  contact  wave  corresponding  to  a  discontinuity  in  T  has  disappeared 
due  to  the  parabolic  heat  conduction  term  V-  (kVT)  in  Eq.  (3). 

(3)  5  ^  0,  K  >  0  (quantum  hydrodynamic  model).  The  eigenvalues  of  the  symbol  Ss  and 
corresponding  modes  are 

Schrodinger 

'  tk  •  u  hyperbolic,  multiplicity  2  (16) 

^k'^K/n  parabolic 

There  are  two  contact  discontinuities  (in  Ut)  in  the  QHD  model  corresponding  to  the  two 
hyperbolic  modes.  Note  that  two  of  the  hyperbolic  modes  (which  allow  shock  discontinu¬ 
ities  to  form)  in  the  classical  hydrodynamic  model  have  become  Schrodinger  modes  when 
the  quantum  corrections  are  included. 

Well-posed  boundwy  conditions  for  the  2D  (and  by  extension  3D)  classical  hydrody¬ 
namic  model  are  formulated  in  Ref.  [7],  assuming  subsonic  flow  at  the  inflow  and  outflow 
boimdaries.  Here  I  will  simply  note  that  in  one  dimension,  the  CHD  model  (with  heat 
conduction)  has  two  hyperbolic  modes,  one  parabolic  mode,  and  one  elliptic  mode,  and 
the  QHD  model  (with  heat  conduction)  has  two  Schrodinger  modes,  one  parabolic  mode, 
and  one  elliptic  mode.  Thus  six  boundary  conditions  are  necessary  for  the  CHD  model  and 
eight  boundary  conditions  for  the  QHD  model.  Well-posed  boundary  conditions  for  the 
ID  CHD  equations  are  n  =  Np,  and  T  =  To  (or  dT/dx  =  0)  at  Xmin  and  x^asi  with  a  bias 
AV  across  the  device:  V(imi„)  =  riog(n/ni)  and  V{xmaz)  =  nog(n/ni)  -|-  eAV,  where 
n,'  is  the  intrinsic  electron  concentration.  For  the  ID  QHD  equations,  I  add  dnjdx  =  0 
at  x^  and  x^. 

m.  The  Second  Upwind  Method 

I  will  discretize  the  ID  steady-state  hydrodynamic  equations  using  an  upwind  method 
adapted  from  computational  fluid  dynamics.  Since  the  upwind  method  requires  velocity 
values  u_^,  U|,  •  *  *  ,  ^  midpoints  of  the  elements  —  •••  ,  N) 


amnecting  grid  points  t  —  1  ;\nd  t,  I  will  use  a  staggered  grid  for  u.  (1  impose  a  computa¬ 
tional  Neumann  boimdary  condition  dufdx  s  0  at  t  s  0  and  t  s  N  to  determine  u_^  and 
ttjv-f  ^‘)  variables  n,  T,  and  V  are  defined  at  the  grid  points  t  =  0,  1,  ■  •  •  ,  iV  —  1,  N. 
Tbe  boundary  conditions  specify  n,  T,  and  V  (and  dn/dx  ior  the  QHD  model)  at  t  =  0 
and  i=s  N. 

In  one  dimension,  the  steady-state  CHD  and  QHD  models  consist  of  the  three  nonlinear 
omservation  laws  for  electron  number,  momentum,  and  energy,  plus  Poisson’s  equation: 

’  /n  1  f  1  r  ®  1  [  ® 

=0  (17) 

jT  dx  ugr  hr  sr 

fv  0  hv  iv 


where  (set  li  =  0  for  the  CHD  model) 


5n  =  n 


=  mnu 


W  =  |nr  +  imnu’  -  log(„)  +  n V 

*■  =  i 

.  <Pv 

hv  =  €-r-r 


/3  _  1  2  <P  .  ,  .  3 

>T  =  ( jnr  +  jmnu  -  log(n)  -  -nTc  j  /r.  (25) 

SV  =  e*(7Vo  _  jVx  -  n)  .  (26) 

Equations  /n  =  0,  /r  =  0,  and  fv  =  0  are  enforced  at  the  interior  grid  points 
t  =  1,  •  •  •  ,  N  —  1,  while  equation  /«  =  0  is  enforced  at  the  midpoints  of  the  elements 
Ij,  i  =  1,  •  •  •  N, 

In  the  secc^nd  upwind  method,  the  advection  terms  d{ug)/dx  in  Elq.  (17)  are  discretized 
using  second  up'^^'ind  differences’ 

^(up).  ss  {Ui^gji  -  u,_jpl)/Ax  (27) 

’The  seond  upwind  method  is  a  ccmservative  extension  of  the  original  first-order  upwind  method. 


29 


where 


(28) 


/  9i  («<+J  >0)  _  /  9i~i  (“<-J  >  0) 

1  9i*i  («i+^  <  0)  ’  ~  I  ft  (“i-J  <  0) 

and  seomd-order  cratral  differences  are  used  for  h^,  hj,  hv,  and  st- 
I  use  Newton’s  method  to  linearize  the  discretized  version  of  Eq.  (17): 


Sn 

[/-] 
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Sn 
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fu 
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+ 1 
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ST 
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It 
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ST 
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[fv  J 

.  V  . 

.  V  . 

.  sv . 

where  J  is  the  Jacobian  and  t  is  a  damping  factor  between  0  and  1,  chosen  to  insure  that 
the  norm  of  the  residuiJ  /  decreases  monotonically. 


rV.  CHD  Simulation  of  the  Electron  Shock  Wave 

The  nonlinear  hyperbolic  modes  of  the  CHD  model  allow  shock  waves  to  develop.  A 
steady-state  electron  shock  wave  in  a  semiconductor  device  was  first  simulated  in  Ref.  [8]. 
The  shock  simulation  has  been  confirmed  by  a  Monte  Carlo  simulation  of  Laux  of  the 
Boltzmann  equation  using  the  DAMOCLES  [9]  program.  The  semiconductor  device  is  an 
n'*'  —  n  —  n'*'  Si  diode  at  77  K  with  0.1  micron  source  and  drain,  with  n***  doping  density 
Nd  =  10“  cm"*,  and  a  1.0  micron  channel,  with  n  doping  density  Nd  =  10^®  cm"®. 

For  the  momentum  and  energy  relaxation  times  in  the  hydrodynamic  model,  I  use 
modified  Baccarani-Wordemann  models: 


where  the  low  energy  momentum  relaxation  time  is  set  equal  to  1.67  picoseconds  from 
the  DAMOCLES  data  for  0.00995  eV  electrons  in  homogeneous  Si  and  v,  —  v,{To)  is  the 
saturation  velocity.  For  Si  at  77  K,  m  =  0.24  m,  and  v,  =  1.2  x  10^  cm/s. 

The  hydrodynamic  and  Boltzmann  simulations  agree  remarkedly  well  [10]  when  the 
amount  of  heat  omduction  in  the  hydrodynamic  model  is  adjusted.  The  best  fit  for  the 
thermal  conductivity  k  in  the  Wiedemann-Franz  law  for  heat  conduction 

q  =  —kVT,  k  =  KoT^nTo/m,  (32) 


is  given  by  ico  =  0.05. 

Figs.  1  and  2  compare  the  hydrodynamic  and  DAMOCLES  simulations  of  the  77  K 
electron  shock  wave  in  Si  at  a  bias  AV  =  1  volt.  DAMOCLEIS  calculates  a  current  of 
4500  amps/cm®,  very  close  to  the  hydrodynamic  value  of  4460  amps/cm®.  The  velocity 
plot  Fig.  1  most  clearly  shows  the  shock  profile,  which  is  spread  out  slightly  due  to  the 
parabolic  heat  conduction  term  in  the  CHD  model.  The  flow  is  supersonic  at  the  velocity 
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peak  just  inside  the  chwnel,  and  subsonic  at  the  end  of  the  ‘^velocity  overshoot”  wave 
where  the  velocity  makes  a  “bend”  to  a  constant  value  in  the  channel. 

The  DAMOCLES  velocity  exhibits  a  Mach  2.1  shock  profile  based  on  both  internal 
evidence^  and  comparison  with  the  hydrodynamic  simulation. 

The  type  of  velocity  overshoot  illustrated  in  Fig.  1  is  always  associated  with  a  shock 
wave  for  the  —  n  —  n*  diode.  As  the  electrons  enter  the  channel,  the  electron  velocity 
increases  rapidly  to  a  peak  value  greater  than  the  saturation  velocity  v«.  At  the  same  time, 
the  electron  temperature  falls  slightly  as  the  electrons  overcome  the  small  potential  barrier 
at  the  source/channel  junction.  Thus  the  electron  M2tch  number  M  near  the  velocity  peak 
is  greater  than  t;,/c  >  v,/co,  where  c  =  ^T/m  is  the  soundspeed  at  temperature  T  and 
is  the  soundspeed  at  the  ambient  temperature  Tq.  For  Si  at  77  K,  i;«  =  1.2  x  10^ 
cm/s,  Cto  =  7.0  X  10®  cm/s,  and  M  >  1.7.  On  the  other  hiind,  the  electron  flow  near  the 
channel/drain  is  subsonic  since  v  «  u,,  while  T  »  To,  making  c  w,. 
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Fig\ire  1:  Hydrodynamic  and  Monte  Carlo  electron  velocity  in  10^  cm/s  for  AV  =  1  volt. 
The  jagged  curve  is  the  DAMOCLES  result.  The  chiuinel  is  between  x  =  0  aind  x  =  1 
micron. 

The  transition  from  supersonic  flow  to  subsonic  flow  in  general  necessitates  a  shock 
wave  in  gas  dyn2unics® — that  is,  a  wave  over  which  density,  velocity,  and  (if  heat  conduc¬ 
tion  equals  zero)  temperature  change  very  rapidly.  The  n*  drain  in  the  source-channel- 

®The  electron  temperature  T  77  K  at  the  shock  wave.  Using  the  effective  electron  mass  ^proxi- 
rnatiem,  the  electron  Mach  number  M  =  v/c  =  vfyJTJm  »  2.1. 

^See  Courant  and  Friedrichs  [11],  pp.  380-387. 
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Figure  2:  Hydrodynamic  and  Monte  Carlo  electron  average  energy  in  eV  for  A  V  =  1  volt. 
The  jagged  curve  is  the  DAMOCLES  result. 

drain  structure  of  the  diode  provides  the  mechanism  that  forces  a  supersonic  flow  in  the 
channel  back  down  to  subsonic  flow. 

The  excellent  agreement  between  the  hydrodynamic  and  DAMOCLES  results  is  re¬ 
markable  in  that  the  hydrodynamic  model  is  orders  of  magnitude  faster  than  Monte  Carlo 
simulation  of  the  Boltzmann  equation.  The  hydrodynamic  model  also  provides  a  mathe¬ 
matical  framework  in  which  to  understand  the  velocity  overshoot  wave  in  the  n'*'  —n  —  n* 
diode. 

y.  QHD  Simulation  of  the  Resonant  Tunneling  Diode 

The  behavior  of  quantum  devices  that  depend  on  particle  tiumeling  through  potential 
barriers  and/or  charge  buildup  in  potential  wells  can  be  efficiently  simulated  using  the 
QHD  model.  Here  I  will  present  simulations  of  a  GaAs  resonant  tunneling  diode  with 
double  AlsGai.xAs  barriers,  with  barrier  height  B  =  0.209  eV.  The  diode  consists  of  an 
n'*'  source  (at  the  left)  and  an  n'*'  drain  (at  the  right)  with  the  doping  density  iVp  =  10** 
cm~*,  and  an  n  diannel  with  No  =  5  x  10**  cm~*  (see  Fig.  3).  The  channel  is  250  A 
long,  the  barriers  are  50  A  wide,  and  the  quantum  well  between  the  barriers  is  50  A  wide. 
To  enhance  resonant  tunneling,  the  device  has  50  A  spacers  between  the  barriers  and  the 
contacts. 

The  barrier  height  B  is  incorporated  into  the  QHD  transport  equations  (l)-(3)  by 
replacing  V  -*  V  B.  (Poisson’s  equation  is  not  changed.) 
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Figure  3:  Doping/lO'*  cm“®. 


Figure  4:  Current  density  in  kiloamps/cm’  vs.  voltage  for  the  resonant  tunneling  diode 
at  77  K.  Ko  =  0.2  (black),  0.4  (dark  gray),  and  0.6  (gray).  The  dots  represent  computed 
solutkm  points.  4 
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F<»'  lower  valley  electrons  in  GaAs  at  77  K,  the  low-energy  momentum  relaxation 
time  TfO  in  Eqs.  (30)  and  (31)  is  set  equal  to  0.9  picoseconds,  the  effective  electron  mass 
m  =  0.063  m«,  and  the  saturation  velocity  (in  Eq.  (31))  v.  «  2  x  10^  cm/s.  The  dielectric 
constant  t  =  12.9  for  GaAs. 

Current-voltage  curves  for  the  resonant  timneling  diode  at  77  K  are  plotted  in  Fig.  4 
for  three  different  values  of  kq  in  the  expression  for  thermal  conductivity  (32).  These  are 
the  first  simulations  [1]  of  the  full  QHD  equations  to  show  NDR  in  the  resonant  tunneling 
diode. 

The  peak  of  the  current-voltage  curve  occurs  as  the  electrons  timneling  through  the 
first  barrier  come  into  resonance  with  the  the  ground  state  of  the  quantum  well.  Note 
the  presence  of  a  “shoulder”  in  the  current-voltage  curve  around  AV  =  0.23  volts.  The 
shoulder  signAls  the  location  of  the  first  virtual  state  of  the  quantum  well.  The  location  of 
the  valley  and  shoulder  can  be  qualitatively  understood  from  the  energy  levels  of  a  square 
well.  For  a  50  A  wide  0.209  eV  high  GaAs  finite  square  well,  there  is  just  one  bound  state 
energy  level  at  0.079  eV.  The  energy  of  the  first  virtual  state  of  the  well  is  0.24  eV. 

The  main  effect  of  larger  values  of  kq  is  to  shift  the  peak  of  the  current- voltage  curve 
to  the  right.  With  lower  values  of  kq,  the  electrons  have  a  higher  average  energy  as  they 
impinge  upon  the  first  barrier,  and  therefore  resonate  with  the  well  at  a  lower  applied 
voltage. 

A  physically  relevant  value  of  kq  is  approximately  0.4  for  this  device.  The  peak  to  valley 
current  ratio  of  1.95  agrees  quantitatively  with  experimental  ratios  for  similar  devices. 

Fig.  5  shows  the  dramatic  charge  enhancement  in  the  quantum  well  typical  of  the 
resonant  tunneling  diode  for  applied  voltages  of  AV  =  0.097  (peak),  0.191  (valley),  and 
0.22  volts  (just  before  the  shoulder)  (with  kq  =  0.4).  The  electron  density  at  the  center 
of  the  quantum  well  increases  as  AV  increases,  and  is  more  than  two  orders  of  magnitude 
larger  than  the  background  doping  density.  Note  the  depletion  of  electrons  around  the 
channel-drain  junction. 

As  illustrated  in  Fig.  6,  the  electrons  spend  the  longest  time  in  the  quantum  well 
for  voltages  near  AV^iiiey  The  “dwell”  time  spent  by  electrons  in  the  well  increases 
monotonically  up  to  voltages  near  AVuaiiey,  and  then  decreases  rapidly.  The  macroscopic 
QHD  dwell  time  differs  qualitatively  from  dwell  times  based  on  microscopic  quantum 
calculations,  which  predict  that  the  dwell  time  is  maximum  at  resonance. 

Since  the  QHD  equations  have  the  same  form  as  the  classical  fluid  dynamical  equa¬ 
tions,  well-understood  classical  bound2try  conditions  can  be  applied  in  simulating  quantum 
devices.  Moreover,  the  QHD  equations  are  expressed  in  terms  of  the  fluid  dynamical  quan¬ 
tities  density,  velocity,  and  temperature.  These  classical  fluid  dynamical  concepts  enable 
us  to  interpret  electron  behavior  in  quantum  devices  in  a  physically  intuitive  way.  We 
can  define  the  time  spent  by  an  electron  in  the  quantum  well  or  the  electron  temperature 
throughout  the  device  in  a  precise  manner.  In  turn,  the  intuitive  understanding  developed 
through  the  QHD  model  sheds  light  on  more  fully  quantum  mechanical  descriptions  of 
electron  behavior  in  quantum  devices. 
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Figure  5:  Log{Density/10*®  cm"®].  The  curves  are  for  AV  =  0.097  (gray),  0.191  (dark 
gray),  and  0.22  (black)  volts. 


Figure  6:  Time  in  picoseconds  (dots)  spent  by  electrons  in  the  quantum  well  vs.  voltage.  I 

Fot  reference,  a  sc^ed  version  of  the  current  density  (solid  line)  is  also  shown.  d 
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Abstract 

Computer  ainnilation  is  an  indispensabfo  tool  for  the  of  new  VLSI  devioes.  Be> 

sides  the  progress  in  physical  modds  and  compntational  techniques  which  has  dominated 
tedtnohgy  computer  aided  design  (TCAO)  in  the  past,  purely  scrftware-related  aspects  are 
attracting  increasing  attenti<m  and  will  beomne  a  crucial  issue  far  future  devdopments  in 
this  field. 

I.  Introduction 

Process  and  device  simulation  is  commonly  used  for  the  design  of  new  VLSI  devioes  and 
processes  and  as  an  explorative  tool  to  gain  a  better  understanding  of  process  and  device 
physics.  On  the  other  hand,  simulation  is  also  carried  out  after  the  design  phase  in  order 
to  optimize  certain  parameters  of  a  technology,  e.g.,  to  inq>rove  device  performance  and 
reliability  or  to  increase  the  yield  [1]. 

For  all  these  tasks  the  term  TCAD,  short  for  teehnohgg  eompnter  aided  design,  was  bom. 
TCAD  includes  both  physically  rigorous  and  Amplified  process  and  device  simulation  in 
one  to  three  spatial  dimensions.  FWthermcxe,  links  to  layout-<»iented  CAD  and  to  dicnit 
simulation  are  required. 

Depending  on  the  particular  iq>plicati<Hi  of  TCAD  tools,  different  demands  arise;  for  the 
development  of  new  technologies  and  for  the  prediction  of  the  behavior  of  new  devices  both 
accuracy  and  robustness  are  required.  In  this  case,  very  sophisticated  physical  models  and 
numerical  techniques  must  be  used,  usually  at  a  high  computational  cost.  An  example  of 
such  a  model  is  the  two-dimensional  simnlation  of  the  transient-enhanced  diffusion  during 
riq>id  thermal  annealing  [2]. 

On  the  other  hand,  for  statistical  simulations  [3]  or  post-design  process  optimizations 
[4],  speed  b  the  most  crudal  issue,  as  physical  modeb  can  be  calibrated  to  an  existing 
manufacturing  process  and  hence  do  not  pose  a  reliability  problem. 

Independently  ci  the  progress  in  advanced  physical  modeling,  the  fast  and  simple  "tuned” 
models  will  still  remain  in  broad  use;  there  is  no  unique  "best  model”  for  all  simnlation 
problems. 

TCAD  invdves  a  number  of  scientific  disciplines  in  addition  to  electrical  en^pneering  and 
computer  sdence.  This  has  also  had  an  impact  on  the  properties  of  the  software  which 
has  been  produced  by  that  heterogenous  community  during  the  past  20  years. 
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n.  frameworks 


For  a  Inng  time  the  importance  of  pure  software  issues  for  TOAD  has  been  underestimated. 
In  the  past  few  years,  as  these  issues  attract  increasing  attention,  the  major  focus  is  on 
the  integration  of  TCAD  tools  into  a  common  framework. 

n.l  TCAD  versus  EC  AD 

In  the  electronic  CAD  (ECAD)  field,  frameworks  for  tool  integration  and  data  interchange 
standards  have  been  emerging  during  the  past  years.  Hardly  any  satisfying  framework 
iTnplwmentAtinnn  are  available  for  TCAD,  especially  for  advanced,  multi-dimensional  pro¬ 
cess  and  device  simulation.  One  reason  for  that  is  the  difference  in  the  sizes  of  the  user 
communities.  Electronic  CAD  is  more  widely  used  than  Technology  CAD. 


Possible  Data  Representation  Design  Abstraction  Level 


:  VHDL 

} . 

;  Logic  Diagram 

;  CtiouR  Diagram 
•  Paramotara,  Cell  Layout 
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Figure  1:  Design  levels  and  device  count  scale  in  ECAD  and  TCAD 

Another  possible  reason  for  the  differences  between  ECAD  and  TCAD  in  terms  of  progress 
in  frameworks  and  standardization  is  shown  in  Figure  1:  in  ECAD  there  have  always  been 


38 


several  clearly  defined  layers  of  design  abstraction.  Though  the  device  count  scale  is  open 
towards  the  high  end,  there  is  a  well-defined  lower  bound  for  ECAD,  which  is  the  single 
device.  For  every  order  of  magnitude  in  device  count  there  is  a  common  abstractimi  level 
and  at  leeut  one  well-established  data  representation  as  welL  For  TCAD  however,  the 
only  evident  lower  boundary  in  terms  of  abstraction  is  the  physical  atom.  There  are 
no  “natural”  intermediate  abstraction  levels  and  as  an  indirect  consequence  of  physical 
modeling  there  is  no  well-established  unique  data  representation  like  in  the  ECAD  case. 

These  considerations  indicate  that  finding  a  unified  data  representation  for  TCAD  is 
dominated  by  semantic  problems  which  are  closely  related  to  the  large  interval  (Figure  1) 
of  device  count  to  be  represented,  together  with  the  lack  of  clearly  defined  intermediate 
abstraction  levels  and  the  multi-disciplinary  background  involved  in  TCAD 

£1.2  Framework  Demands 

As  am  operating  environment  for  TCAD  tools  and  engineers,  a  TCAD  framework  must 
provide  the  following  key  features; 

•  allow  minimum  effort  integration  of  existing  tools  and  faunlitate  the  development  of 
new  tools 

•  allow  casual  users  to  use  simulation  in  a  blaudc  box  manner 

•  provide  enough  flexibility  on  the  task  level  to  accomodate  easily  to  new  design  trsks 

•  provide  am  extensible  database  for  design  representation 

•  be  “open”  in  terms  of  platform  independence,  availaibility  and  the  use  of  open 
standards 

•  provide  standaird  functionality  like  visuadizattion,  interactive  structure  editing  and 
postprocessing  as  generic  tools 

n.3  Existing  Approaches 

Facing  these  rather  rigorous  demamds  and  the  potentiaJ  problems  stated  above,  one  camnot 
expect  to  find  am  easy  and  fast  path  leawling  to  the  ultimate  TCAD  fraunework.  However, 
various  attempts  to  head  in  that  direction  may  be  found: 

In  the  semiconductor  industry  there  have  been  emerging  a  number  of  remarkable  frame¬ 
work  efforts  worldwide,  such  as  am  integrated  system  for  stadisticad  VLSI  design  [5]  (Hi¬ 
tachi,  Japan),  an  integrated,  graphical  device  design  environment  [6]  (Phillips,  UK),  SAT¬ 
URN  [7]  (Siemens,  Germamy)  or  the  MECCA  system  [8]  (AT&T,  USA). 

Baised  on  am  initiad  proposal  by  Duvall  [9],  there  have  been  various  Profile  Interchange 
Format  (PIF)-batfed  design  environments  both  in  industry  (like  the  PRIDE  [10]  system) 
and  at  universities  (e.g.  PROSE  [11]  from  UC  Berkely). 


Commercial  TOAD  vendors  are  int^rating  thdr  tools  and  providing  them  with  unified 
uaet  interfaces,  like  STUDIO  from  Techncdogy  Modeling  Associates,  or  MASTEIRPIECE 
frtnn  Silvaco  Data  Systems. 

Depending  on  the  intoitions  of  the  creator  of  the  frameworks,  different  aspects,  like  a 
rigorous  task  level  implementation  or  a  comfortable  user  interface  [12],  have  been  empha¬ 
sized.  Unfortunately,  this  has  often  been  done  at  the  mq>en8e  of  portability,  or  by  leaving 
out  a  unified  data  representation. 

Recently,  a  client-server  framework  architecture  has  been  introduced  by  the  semiconduc¬ 
tor  wafer  representation  working  group  [13]  of  the  CAD  F^mewoiic  Initiative  (CFI),  an 
international  standardization  committee  for  ECAD.  The  intriguing  goal  of  this  approach  is 
to  separate  the  physical  modeling  completely  from  tedious  tasks  such  as  grid  generation, 
interpolation,  or  geometry  handling  by  providing  these  functions  as  a  black-box  server 
which  is  accessed  by  the  simulation  clients  via  a  procedmal  interface.  This  method  is 
very  well-suited  for  the  simulation  of  topography  formation,  however,  it  can  be  detrim«i- 
tal  to  applications  with  high  data  throughput  or  implications  which  exhibit  performance 
advantages  thanks  to  a  tight  coupling  between  physical  models  and  numerical  techniques. 

In  an  attempt  to  address  all  of  the  framework  demands  stated  above,  we  have  developed 
VISTA,  the  Viennese  Integrated  System  for  TCAD  Applications.  It  consists  of  a  PIF 
Database,  which  is  an  enhanced  intertool  version  of  the  well-known  profile  interchange 
format  proposed  in  [9].  To  accomodate  the  needs  of  existing  TCAD  applications,  the 
original  PIF  83mtax  was  restructured  by  reducing  the  number  of  different  constructs, 
adding  a  few  new  constructs  such  as  tensor  product  grid  definition,  and  by  defining 
additional  semantic  rules  for  the  use  of  standardized  attributes.  Our  PIF  implementation 
can  be  used  to  store  arbitrary  LISP  expressions  for  process  flow  representation. 

Simulators  and  all  other  tools  access  the  PIF  database  using  the  PIF  Application  Interface 
(PAI)  [14],  which  supports  several  programming  languages  including  C,  FORTRAN  and 
LISP.  The  PAI  is  a  procedural  interface  for  accessing  the  binary  PIF  database.  It  provides 
fimctionality  for  creating,  reading  and  modifying  PIF  objects.  In  this  way  the  application 
programmer  does  not  need  to  know  too  much  about  the  PIF  syntax  to  be  able  to  use 
the  PIF  database.  The  PAI  was  designed  as  a  strictly  layered  product  to  guarantee  the 
necessary  functionality,  performance  and  extensibility. 

A  system  layer  hides  all  system  dependencies  concerning  communication  with  the  operat¬ 
ing  system  from  the  rest  of  the  PAI.  A  caching  layer  takes  care  of  performance  and  space 
requirements.  The  interface  layer  allows  access  to  the  PIF  objects  suited  for  advanced 
C  and  is  the  standardized  interface  to  the  PIF  database.  The  application  layer  provides 
a  more  comfortable  access  to  PIF  objects  for  applications  written  in  C,  FORTRAN  or 
LISP. 

For  the  conversion  from  the  binary  intertool  form  to  the  intersite  ASCII  format  the  PIF 
binary  file  manager  (PBFM)  has  been  developed.  In  this  way  the  intersite  exchange  of 
PIF  files  between  different  hardware  and  software  platforms,  for  instance  via  electronic 
mail,  is  supported. 

The  simulators  are  controlled  by  an  inteq>retmg  TCAD  shell  [15]  which  integrates  all 
system  compiments  on  the  task  leveL  The  major  benefits  of  selecting  LISP  as  extension 


language  are  that  full-fledged  programming  language  features  like  branches,  loops  and 
subprograms,  mechanisms  for  defining  new  variables  and  and  standard  mathematical 
operations  and  expressions  can  be  utilized  to  define  complex  development  tasks.  The 
XLISP  [16]  interpreter  which  we  have  chosen  as  basis  for  the  TCAD  shell,  is  a  publicly 
available  software  product  written  in  portable  C,  and  has  the  additional  feature  of  bong 
highly  system-independent. 

The  user  interface  of  VISTA  [17]  is  baaed  on  the  X  Window  sjrstem.  By  means  of  the  X 
Toolkit  [18]  it  implements  most  of  the  functionality  which  is  required  to  support  TCAD 
information  flow  as  so-called  widgets.  The  user  interface  is  tightly  coupled  with  the  XLISP 
interpreter.  This  combination  of  a  widget-oriented  user  interface  with  w  interpreter  is 
known  to  be  a  very  flexible  and  promising  concept  [19],  [20],  [21]. 

An  interactive  device  editor,  generic  postprocessing  and  visualization  modules  are  also 
part  of  the  framework.  Special  emphasis  hu  been  put  on  the  Tise  of  open  portable  subsys¬ 
tems  (which  are  mostly  public  domain  products)  to  achieve  a  high  degree  of  portability. 
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Figure  2:  Comparison  of  source  code  sizes.  VKITA  does  not  include  simulation  tools. 

Figure  2  shows  the  code  sizes  of  well-known  simulation  toob.  SAMPLE  [12]  (from  UC 
Berkeley)  is  a  topography  simulation  tool,  MINIMOS  [22]  and  PROMIS  [2]  are  two- 
dimensional  MOS  device  and  process  simulation  programs,  respectively,  developed  at  TU 
Vienna.  SUPREM4  is  a  two-dimensional  process  simulation  program  and  PISCES  is  a 
two-dimensional  device  simulation  program,  both  from  Stanford  University  [23]. 

Given  VISTA ’s  remarkable  code  size  compared  to  these  classical  single  simulation  toob, 
it  becomes  obvious  that  a  framework  for  tool  integration  and  development  must  exhibit 
a  homogenous  architecture  in  order  to  be  comprehennble  and  maintsdnable. 

This  can  be  achieved  through  the  use  of  unified  concepts.  The  generalization  cf  >rri8ting 
solutions  within  the  framework  should  be  favoured  over  the  introduction  of  new  ;  r  es.  A 
second  key  to  a  comprehensible  system  is  the  use  of  abstract  concepts.  In  VISTA,  tedious 
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parts  of  the  sooroecode,  like  the  PAI  or  intotfsce  code  which  is  required  f<»  multilanguage 
progrmmmmg,  is  generated  automatically  bom  abstract  specifications.  Only  the  creative 
parts  are  left  to  the  programmer. 

m.  The  Future 

It  is  a  fact  that  the  rapidly  Bhrinlting  device  dimensions  and  the  increasing  cost  of  ex¬ 
perimental  design  optimization  have  been  stimulating  the  evolution  of  TOAD;  it  can  be 
expected  that  TOAD  will  continue  to  evolve  into  several  directions: 

The  exact  future  development  of  physical  models  is  hard  to  predict,  as  models  always 
reflect  the  state  of  technology  [24],  [25].  FHiture  models  will  have  to  be  based  on  a  lower 
level  ci  abstraction.  As  a  consequence  the  complexity  and  the  “bandwith”  of  modeb  used 
for  TOAD  will  increase.  As  to  computational  techniques,  it  is  obvious  that  the  availability 
of  massively  parallel  computers  will  stimulate  the  development  and  use  of  parallelizable 
methods,  like  Monte  Carlo  simulation,  e.g.  [26]. 

Besides  the  progress  in  physical  modeling,  the  future  of  process  and  device  simulation  will 
be  significantly  influenced  by  the  introduction  of  TOAD  frameworks  and  its  impacts.  This 
integration  will  allow  simulation  to  catch  up  with  the  physical  reality.  It  is  to  be  hoped 
that  TCAD  frameworks  will  help  to  reduce  the  giq>  between  the  engineer’s  simulation 
needs  and  the  sophistication  of  the  avidlable  models  and  simulation  methods  by  providing 
a  ’^lug-and-play”  environment  for  tool  developers.  One  major  requirement  towards  this 
goad  is  the  semantic  standardization  for  TCAD  data,  which  concerns  describing  process 
information  in  a  comprehensive  and  unambiguous  way  rather  than  finding  appropriate 
representation  methods.  Client-server  concepts  can  be  expected  to  be  used  in  future 
simulation  tools  in  different  places,  e.g.  for  solving  large  linear  systems. 

A  new  object-oriented  method  for  CAD  tool  management  has  been  demonstrated  through 
the  Cadwell  design  framework  [27].  We  believe  that  a  nmilar  method  of  tool  abstraction 
should  be  employed  in  a  framework  for  advanced  process  and  device  simulation  as  well. 

The  importance  of  process  and  device  simulation  will  increase  in  general,  and  as  a  con¬ 
sequence  of  the  broader  use,  performance,  tf^ther  with  robustness  and  ease  of  use  will 
become  even  more  crucial.  The  integration  of  different  process  and  device  simulators  into 
modem  TCAD  systems  will  become  a  necessity  because  of  the  inherent  communication, 
data  transfer  and  maintainance  advantages. 
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Abstract 

In  this  paper  we  present  a  new  finite  element  approach  for  the  simulation  of  recessed  T- 
gate  M^I%Ts  and  H^MTs  based  on  quadrilateral  finite  elements.  The  discretization  of 
die  current  continuity  equation,  which  is  the  crucial  part  of  the  simulation,  is  followed  in 

details.  Two  simulation  examples  •  a  200  nm  gate  length  MESFET  and  a  5-doped 
pseudomorphic  HEMT  illustrates  the  usefulness  of  the  adopted  approach. 


I.  Introduction 

The  performance  of  the  modem  nanometer-scale  MESFETs  and  HEMTs  becomes  strongly 
affected  by  device  parasitics  such  as  coupling  capacitances  and  access  resistances  [1].  In 
recessed  gate  devices  these  parasitics  are  critictdiy  affected  by  the  shape  and  surface  condition 
of  the  recess  region.  In  addition  the  T-gate  process  designed  to  reduce  the  gate  series  resistance 
[2]  may  also  reinforce  the  parasitic  capacitances.  Although  Hydrodynamic  [3]  and  Monte  Cario 
[4]  simulation  programs  are  making  significant  progress  in  properly  describing  the  non¬ 
equilibrium  transport  phenomena  in  compound  reTs,  the  real  shape  of  the  gate  recess  is 
generally  poorly  modelled,  assuming  planar  or  rectangular  simulation  domains.  Surface  effects 
are  also  either  neglected  or  modelled  by  fixing  the  surface  potential  or  by  increasing  the  surface 
doping.  Yet  it  is  well  known  that  this  effects  can  in  many  cases  have  a  more  profound  impact 
on  device  DC  characteristics  and  high  frequency  performance  than  the  transport  details  in  the 
'intrinsic'  region  under  the  gate. 

In  this  paper  we  describe  the  implementation  of  a  fmite  element  approach  based  on  quadrilateral 
finite  elements  for  a  precise  description  of  the  device's  geometry  and  the  realistic  inclusion  of 
the  surface  effects  in  the  simulation  of  recessed  gate  MESFETs  and  HEMTs.  Several  simulation 
examples  illustrate  the  work  of  the  developed  on  this  basis  Heterojunction  2D  Finite  element 
(H2F)  simulator. 


II.  Model  Description  and  Discretization 

The  heterojunction  compound  semiconductor  device  model  equations  used  in  the  current 
version  of  H2F  for  steady  state  simulation  include  Poisson's  equation  and  electron  current 
continuity  equation  in  a  drrft  diffusion  ap(n'oximation 


V.[£(r)Vv^]  =  -p(F)  (1) 

V.J.=0  (2) 

7,  =  V  %  +  eOyn  (3) 
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(4) 


where  all  symbols  have  their  usual  meaning  While  this  approach  is  unable  to  describe  precisely 
the  device's  transport,  in  many  cases  it  is  justified  by  the  need  to  accurately  predict  the  device 
parasitics. 

A  great  deal  of  attention  has  been  paid  to  the  proper  handling  of  the  surface  effects  in  the 
simulation.  The  charge  term  in  the  Poisson's  equatitm  is  given  by 

p(r )  =  q(Nl^  +  +p-n)  (5) 

Where  and  represent  charged  bulk  and  interface  states  respectively.  The  symbol 

S{1)  indicates  that  Nl^  is  surface  charge  which  may  be  places  along  any  (in  the  general  case 
curvilinear)  interface  in  the  device.  Similar  to  the  bulk  model  the  generalised  interface  trap 
model  includes  acceptor  and  donor  like  states  with  an  arbitrary  energy  position  whose 
occupation  depends  on  the  quasi-Fermi  level  and  the  potential  variations.  Hie  simulation 
domain  includes  the  space  above  the  semiconductor  surface  providing  a  proper  interaction 
between  the  charge  on  the  surface  states  and  the  spreading  surface  potent^. 

Quadrilateral  finite  elements  have  been  used  for  the  discretization.  The  grid  is  generated  by 
appropriate  deformation  of  originally  rectangular  sub  domains.  Fig.  1 a,b  illustrates  this 
procedure  for  the  gap  between  the  gate  and  the  cap  layer  of  a  recessed  gate  FET 


(a)  (b) 

Fig.  1  Generation  of  the  quadrilateral  grid  in  the  recess  gate  region,  (a)  the  initial  rectangular 
grid  (b)  the  quadrilateral  grid  after  suitable  deformation 

The  Galerldn  fmite  element  method  has  been  adapted  to  solve  the  Poisson's  equation.  The 
correqxmding  integration  over  the  quadrilateral  elements  during  the  discretization  was  carried 
out  by  a  linear  isr^iarametric  mapping  (Hg.  3)  A  control  volume  method  has  bwn  adopted  [5] 
for  the  discretization  of  the  current-continuity  equation  (Fig.  4).  In  this  approach  each 
quadrilateral  element  is  divided  into  four  subelements  and  the  discretization  is  carried  out 
balancing  the  current  flowing  in  and  out  of  the  subelements  attached  to  a  given  condensation 
point  Thus  for  vertex  1  (See  Rg.  4)  l4j  -  l2j  =0.  A  central  point  integration  is  used  to  calculate 
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the  current  through  each  of  the  subelement  sides.  For  example  where  the  current 

dotsity  Jj2  is  approximated  by  the  standard  Gummel  expression. 

The  growth  functions  involved  in  the  derivation  of  this  expression  are  also  used  for 
interpolation  of  the  electron  concentration  along  the  sides  of  the  element  The  electron 
concentration  in  point  A  for  example  is  given  by 


It  has  been  found  that  this  discretization  is  stable  for  arbitrary  shapes  of  the  quadrilateral 
elements  and  does  not  leads  to  the  spikes  typical  for  obtuse  triangles.  The  same  approach  may 
be  used  for  discretization  of  the  momentum  and  energy  conservation  equation  in  the  quasi¬ 
hydrodynamic  transport  treatment. 


Fig.  2  Linear  isoparametric  mapping  Fig.  3  Discretization  of  the  current  continuity 

equation 


The  grid  generation  preserves  the  number  of  grid  points  in  lateral  and  vertical  directions  and 
leads  to  a  regular  nine  diagonal  matrix  of  the  discretized  equations.  A  Fast  Incomplete  LU 
Factorisation  Biconjugate  Gradients  (ILUBCG)  solver  is  usi^  for  the  numerically  intensive 
iterations.  The  solution  of  the  Poisson's  equation  involves  only  a  few  biconjugate  gradient 
steps  per  Newton  iteration  that  significantly  reduces  the  total  computation  time.  The 
convergence  problems  related  to  the  strongly  localised,  potential  dependent  interface  charge 
have  b^n  resolved  by  appropriate  dumping.  ILUBCG  ^so  solves  without  complication  the 
discretized  current  continuity  ^uadon. 

Although  H2F  is  'serial'  code,  by  using  a  pipeline  flleserver,  muldple  copies  of  the  program 
can  be  run  concurrently  on  MIMD  machines  (in  our  case  a  Parsytec  Model  64  transputer 
system),  calculating  in  parallel,  separate  sets  of  input  device  parameters.  This  extends 
dramatically  the  capability  of  the  simulator  for  real  design  work  such  as  structure  optimisation, 
sensitivity  analysis  and  yidd  prediction  where  several  hundred  simulations  are  oftmi  carried  out 
during  single  investigation. 
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111.  Examples 

Two  simulation  examples  illustrate  the  af^lication  of  the  developed  quadrilateral  finite  element 
a{^proach  for  the  simulation  of  complex  shaped  devices.  The  first  example  is  concerned  with  ^ 
simulation  of  a  200  nm  gate-length  state  of  the  art  MESFET,  fabricated  in  the  Nanoelectronics 
Research  Centre  at  the  University  of  Glasgow  [6].  The  flexibility  of  the  quadrilateral  grid  is 
illustrated  in  Ing.  4  where  the  cross  sectional  photograph  of  the  device  is  compared  with  the 
cone^nding  H2F  simulation  domain. 


(a) 


(b) 


(c) 


Fig.  4  Simulation  of  a  200  nm  gate  length  MESFET.  (a)  Cross  sectional  SEM  view  (b)  the 
corresponding  finite  element  grid  (c)  potential  distribution  at  Vg=-0.4V  and  Vd=2.5  V 


Fig.  S  Simulated  and  measured  1d*Vd  characteristics  of  the  device  illustrated  in  Fig.  4.  (a) 
influence  of  acceptor  type  surface  states  position  (b)  influence  of  the  surface  state 
(tensity 

The  influence  of  the  position  and  the  density  of  these  surface  states  on  the  device's  1d~Vg 
characteristics  for  the  MESFET  shown  in  Fig.  4  is  given  in  Fig  5  a,b.  Our  experimental 
measurements  are  in  good  agreement  with  the  expected  position  and  states  density  Ps=0.6  eV 
and  Nit=2el2  cm*^. 

The  second  example  is  the  simulation  of  a  delta-doped  pseudomorphic  HEMT  whose  behaviour 
is  significantly  influenced  by  the  sertes  resistances  introduced  by  an  undoped  cap  layer  (Fig.  6). 
Although  the  drift  diffusion  approach  underestimates  the  current,  it  has  been  found  that  by 
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adjusdog  the  Mtimtion  velocity  in  the  mobility  model  (to  1.4x10^  cm/s  in  this  case)  the 
measured  characteristics  cm  be  accqitably  matched. 


X,  cm  xlE-5  Y 

(a)  (b) 


Fig.  6  Simulation  of  200  nm  gate  length  pseudomorphic  HEMT  (a)  device  structure  and 
potential  distribution  at  Vg=3.5  V  and  Vd=2.5  V  (0  correspond  to  -4  V  and  the 
increment  is  0.5  V)  (b)  measured  (lines)  and  calculated  (symbols)  Id  -  Vq 
characteristics 


IV.  Conclusions 

The  developed  quadrilateral  Hnite  element  approach  provides  the  necessary  flexibility  for  the 
proper  hant^g  of  shape  and  surface  associate  parasitic  effects  in  the  modem  recessed  T-gate 
FETs.  In  the  example  simulations,  where  device  parasitics  play  an  important  role,  the 
implemented  drift-diffusion  approach  leads  to  a  reasonable  prediction  of  the  device  behaviour 
widi  a  simple  tuning  of  the  saturated  carrier  velocity.  However,  the  proposed  finite  element 
scheme  may  be  extended  to  incorporate  the  hydrodynamic  approach  in  the  same  manner  as  the 
finite  (hfretence  case. 


References 

[1]  P.H,  Ladbroke,  A.J.  Hill  and  J.P.  Bridge,  "Fast  FET  and  HEMT  Solvers  for 
Microwave  CAD",  J.  Microwave  and  Millimetre-Wave  Computer  Aided  Eng.,  Vol.  3, 
No.  1.  pp.37-60.  1993. 

[2]  P.C.  Chao.  P.M.  Smit,  S.C.  Plamaeteer,  and  J.C.M.  Hwang,  "Electron  Beam 
Fabrication  of  GaAs  Low-Noise  MESFETs  Using  a  New  TriLauer  Resist 
Technique",  IEEE  Trans.  Electron  Dev.,  Vol.  ED-32,  No.  6,  pp.1042-1045, 1985. 

[3]  J.-R.  ^ou  and  D.K.  Ferry,  "Simulation  of  Ultra-Small  GaAs  MESFET  Using 
Quantum  Moment  Equations",  IEEE  Trans.  Electron  Dev.,  Vol  39,  No.  3,  pp.473- 
478,  1992. 

[4]  LC.  Kizilyalli,  M.  Artaki  and  A.  Chandra,  "Monte  Carlo  Study  of  GaAs/AlxGai.xAs 
MODFETs:  llie  Effect  of  AlxGai.xAs  composition",  IEEE  Trans.  Electron  ^v.,  Vol. 
38,  No.  2,  pp.197-206.  1991. 

[5]  A.  Asenov  and  E.  Stefanov,  "IMPEDANCE  2.0:  A  Flexible  Concept  for  Process  and 
Device  Simulation”,  Proc.  ISPPM,  pp.272-286,  Vama  1989. 

[6]  N.I.  Cameron.  G.  Hopkins,  LG.  Thayne,  S.P.  Beaumont,  C.D.W.  Wilkinson,  M. 
Holland,  A.H.  Keanand  and  C.R.  Stanley,  "Selective  Reactive  Ion  Etching  of 
GaAs/AK3aAs  Metal-Semiconductor  Held  Efl^  Transistors",  J.  Vac  Sci.  TedinoL  B, 
VoL  9  p.3538, 1991. 


Numerical  Simulation  of  IGBTs  at  Elevated  Temperatures 

Andrew  R.  Brown Asen  Asenovt,  John  R.  Barker^,  Steve  Jones*  and  I^ter  Waind* 

tDepartment  of  Electronics  and  Electrical  Engineering, 

University  of  Glasgow,  Glasgow  G12  8QQ,  UK 

*GEC  Plessey  Semiconductors, 

Caiholme  Road,  Lincoln  LNl  ISG,  UK 


Abstract 

In  this  paper  we  present  a  2D  flnite  difference  device  simulator  which  has  been  developed  to 
underpin  the  design  and  optimisation  of  power  IGBTs.  The  simulator  has  been  used  to 
investigate  the  effect  of  elevated  temperatures  on  the  performance  of  the  IGBT  for  use  in 
environments  where  ambient  temperatures  up  to  200°C  are  likely.  The  preliminary  results  of 
this  investigation  are  reported  here. 


I.  Introduction 

The  Insulated  Gate  Bipolar  Transistor  (IGBT)  (Fig.l)  is  at  present  one  of  the  most  widely  used 
power  semiconductor  devices  [1].  Its  popularity  among  the  designers  of  power  electronics 
systems  is  due  to  the  lucky  combination  of  a  low  forward  voltage  drop,  typic^  for  power  BJTs, 
and  gate  controlled  turn-off  and  high  switching  speed  characteristic  of  power  MOSFETs  [2].  At 
the  same  time  the  coupled  field  effect  and  bipolar  actions  make  the  device  operation  quite 
complicated  [3]. 

PoVsitioon  Matai  IGBT  Cathode 


Fig  1.  IGBT  structure  Fig  2.  IGBT  cell  structure  and  simulation 

domain 

The  deviM  tehayiour  is  singly  affe^  by  the  cell  geometry,  MOSFET  channel  length,  actual 
doping  distribution  and  lifetime  killing  treatments.  The  traditional  empirical  approach  to  the 
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device  design  is  inadequate  and  impractical.  It  is  widely  recognised  that  the  device  design  and 
optimisation  should  rely  on  a  proper  2D  or  even  3D  numeric^  simulation  [4],  Although  many 
general  purpt^  simulation  packages  are  now  available,  they  are  not  usually  optimised  for  tte 
simulation  of  IGBTs  and  frequently  do  this  job  slowly  and  unreliably. 

Here  we  report  on  a  new  power  semiconductor  device  simulator  developed  to  underpin  the 
design  and  the  optimisation  of  IGBTs.  The  simulator  have  been  applied  to  study  the  behaviour 
of  relatively  short  channel  IGBTs  at  high  temperatures  up  to  2(X)°C.  Such  high  temperature 
operating  conditions  are  typical  for  the  application  of  these  devices  in  aircraft  power  conversion 
systems. 


II.  The  Simulation  Program 

Our  steady  state  IGBT  simuator  is  based  on  the  solution  of  the  Poisson  equation  and  current 
continuity  equations  for  electrons  and  holes.  Heavy  doping  effects  are  included  through 
bandgap  and  electron  affinity  variation  in  a  manner  similar  to  those  used  for  the  simulation  of 
compound  semiconductor  devices.  A  finite  difference  method  is  adopted  for  discretisation. 
Although  it  is  well  known  that  the  global  Newton  procedure  provides  better  convergence  in  the 
case  of  strongly  coupled  equations,  a  modified  Gummel-like  iterative  scheme  is  used  for 
nonlinear  iterations  because  it  provides  a  simpler  way  towards  the  parallelisation  of  the 
simulation  code  which  will  be  the  next  step  of  this  development.  An  appropriate  logarithmic 
dumping  for  the  Poisson  equation  in  combination  with  a  bounded  change  in  electron  and  hole 
concentrations  [S]  ensure  convergence  in  the  whole  dynamic  range  of  applied  on-state  voltages 
up  to  latch-up  conditions.  In  reverse  bias  mode  the  logarithmic  dumping  provides  convergence 
for  the  Poisson  equation  up  to  several  thousand  volts.  A  fast  Incomplete  LU  Factorisation 
Biconjugate  Gradient  (ILUBCG)  solver  is  employed  for  the  solution  of  both  Poisson  and 
current  continuity  equations.  The  solution  time  of  the  three  nonlinear  equations  in  the  Gummel 
cycle  is  significantly  reduced  if  only  a  few  ILUBCG  steps  follow  each  nonlinear  Newton  like 
step. 

In  order  to  simulate  properly  the  high  temperature  IGBT  behaviour  appropriate  semi-empirical 
expressions  are  included  for  all  relevant  silicon  parameters.  The  temperature  dependent  mobility 
model  of  Nishida  and  Sah  [6],  which  includes  the  scattering  mechanisms  of  surface  and  bulk 
acoustical  and  optical-intervalley  phonons,  bulk  ionised  impurities,  oxide  charges,  surface 
roughness  and  dipoles  or  neutral  surface  states,  was  modified  to  account  for  the  carrier-carrier 
scattering  on  high  injection  cases.  The  recombination  terms  consist  of  two  components:  a 
Shockley-Read-Hall  term  and  an  Auger  recombination  term.  Although  slight  temperature 
dependence  of  the  electron  and  hole  lifetimes  and  Auger  coefficient  may  be  expected,  due  to  the 
lack  of  relevant  data  we  assume  in  our  analysis  that  these  parameters  are  temperature 
independent 

Fig.2.  shows  an  octagonal  IGBT  cell  structure  with  our  solution  domain  which  is  a  cross 
section  along  the  line  A-A'  and  which  takes  into  account  the  symmetry  between  two 
neighbouring  cells.  It  is  clear  however  that  the  2D  simulation  is  quite  a  rough  approximation  to 
the  actual  3D  nature  of  this  device  and  cannot  represent  cell  interaction  totally  accurately,  for 
example  the  influence  of  the  distributed  parasitic  vertical  JFET.  It  is  planned  to  develop  a  3D 
device  modeling  program  implemented  on  a  Parsytec  parallel  transputer  system  in  order  to 
investigate  the  effects  due  to  the  inherent  3D  nature  of  the  device.  Additional  uncertainty  in  the 
results  of  the  simulation  is  introduced  by  the  approximation  for  the  lateral  distribution  of  the 
impurities  in  the  MOSFET  channel.  For  a  given  vertical  doping  profile  (measured  using 
spreading  resistance  method),  experimental  results  have  shown  that  different  assumptions  for 
the  lateral  impurity  distribution  can  have  a  marked  effect  on  the  subthreshold  MOSFET 
characteristics  and  the  threshold  voltage.  It  is  therefore  of  vital  importance  when  modeling  an 
actual  device  to  have  an  accurate  2D  and  in  future  3D  description  of  the  doping  profile, 
particularly  of  the  channel  region,  if  the  results  are  to  give  a  reliable  model  of  the  device 
characteristics. 


51 


Ill  Results 


Three  typical  Id^Vd  curves  calculated  at  room  temperature,  100°C  and  200°C  are  given  in  Fig.3 
for  Vg=15V  which  is  the  expected  operating  gate  voltage  for  the  device.  Fig.4.  shows  a  plot  of 
Id  vs  Vq  for  room  temperature  and  200°C  at  a  drain  voltage  Vd=2V.  Both  a  logarithmic  plot 
(solid  lines,  left  axis)  and  a  linear  plot  (dotted  lines,  right  axis)  are  shown  on  the  same  graph. 
The  potential,  electron  and  hole  distributions  (Vg=15V,  Vd=2V)  for  the  top  region  of  the 
device  (to  a  depth  of  ISpm)  at  room  temperature  and  at  200°C  are  presented  in  Fig.5(a,b,c)  and 
Fig.6(a,b,c)  respectively.  Equipotentials  are  at  -0.15V  intervals  while  equi-concentration 
contours  show  the  power  of  ten  of  the  concentration,  i.e.  contour  no.  16  represents  lO^^m'^ 


Fig.3.  Id-Vd  characteristics  at  different  Fig.4.  Id-Vg  characteristics  at  different 
temperatures  (Vg=I5V)  temperatures  (Vd=2V) 


The  high  temperature  has  two  main  effects  on  the  IGBT  characteristics.  Firstly  the  on-state 
current  is  reduced  by  over  50%  as  the  temperature  increases  to  200°C  as  can  be  seen  clearly 
from  the  Id-Vd  plot  and  also  from  the  1d-Vd  plot  for  voltages  above  the  threshold  voltage.  This 
means  a  significant  increase  in  the  on-state  resistance  and  on-state  power  losses.  The  overall 
current  decrease  at  2(X)°C  is  mainly  due  to  the  mobility  reduction  that  decreases  the  MOSFET 
current  and  at  the  same  time  increases  the  resistance  of  the  drift  region.  The  larger  voltage  drop 
across  the  drift  region  at  200°C,  as  a  result  of  higher  resistance,  is  clearly  seen  in  Fig.6(a).  As 
can  be  seen  from  the  comparison  of  the  electron  concentrations  in  Fig.5(b)  and  Fig.6(b)  the 
carrier  concentrations  in  the  drift  region  in  conductivity  modulation  mode  are  almost  the  same 
(approximately  10*^  cm'^). 

The  Id-Vg  characteristic  reflects  the  MOSFET  action  of  the  IGBT  gate  and  as  such  exhibits  the 
kind  of  temperature  dependence  that  is  expected  for  a  MOSFET  [7,8,9].  This  is  a  reduction  in 
the  current  for  voltages  above  the  threshold  associated  with  the  drop  in  mobility,  the  lowering 
of  the  threshold  voltage,  and  a  decrease  in  the  sub-threshold  slope. 

The  second  important  effect  at  elevated  temperature  is  the  decrease  in  the  static  latch-up  current 
associated  with  the  turn  on  of  the  parasitic  thyristor,  which  exists  in  the  device  structure  [10].  In 


a)  Potential  Distribution 


b)  Election  Concentration  c)  Hole  Concratration 


Fig.  5  Potential  and  carrier  concentration  distributions  at  room  temperature  (Vg=15V,  Vd=2V) 


a)  Potential  Distribution 


b)  Electron  Concentration  c)  Hole  Concentration 


Fig.  6  Potential  and  carrier  concentration  distributions  at  200°C  (Vg=15V,  Vd=2V) 


a)  Potmtial  Distribution 


b)  Electron  Concentration  c)  Hole  Concentration 


Fig.  7  Potential  and  carrier  concentration  distributions  at  latch-up  (Vg=15V.  Vd=3V,  T=27*C) 
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the  p-region  the  hole  current  flows  around  the  n'*'-region  and  due  to  the  resistance  of  the 
/>-iegion  a  voltage  drop  is  produced.  If  the  current  is  large  enough  then  this  potential  difference 
will  be  sufficient  to  forward  bias  the  n*-p  junction  at  the  channel  end  injecting  electrons  from 
the  n'''-region  into  the  /^-region,  turning  on  the  parasitic  thyristor  and  latching  up  the  device.  At 
VcplSV  the  static  latch-up  occurs  at  a  current  ^nsity  of  approximately  Id=1  ISOA/cm^  at  room 
temperature  and  at  lD=440A/cm2  at  200°C.  The  reduction  in  the  static  latch-up  current  is  due  to 
the  reduction  in  the  built-in  potential  of  the  n'^'-p  junction  and  to  the  increase  in  the  resistance  of 
the  p-region.  The  potential,  electron  and  hole  distribution  at  latch-up  are  give  in  Fig.7(a,b,c).  It 
is  interesting  to  note  that  the  latch-up  occurs  close  to  the  channel,  near  the  comer  of  the 
/i+-region. 


IV.  Conclusions 

A  program  has  been  developed  which  is  suitable  for  the  simulation  of  IGBTs  and  which 
includes  the  temperature  dependence  of  semiconductor  parameters.  The  study  of  the  high 
temperature  IGBTs  operation  has  shown  that  increasing  the  operating  temperature  of  the  device 
results  in  a  significant  reduction  in  on-state  drain  current  for  an  applied  drain  voltage.  The 
forward  voltage  drop  across  the  drift  region  of  the  device  also  increases,  as  a  result  of  increased 
resistivity,  which  leads  to  an  increase  in  on-state  losses.  However  this  is  accompanied  by  the 
lowering  of  the  threshold  voltage.  The  elevated  temperature  also  reduces  the  maximum  available 
drain  current  before  latch-up  of  the  device  occurs. 
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Abstract 

A  new  comprehensive  and  efficient  bipolar  junction  transistor  (BJT)  device  model  is 
presented.  This  model  self-consistently  solves  the  Boltzmann  trainsport  equation  (BTE) 
for  electrons,  the  current-continuity  equation  for  holes  and  the  Poisson  equation.  The 
calculations  provide  almost  the  same  information  as  similar  Monte  Carlo  simulations,  but 
require  only  1%  of  the  CPU  time.  In  addition,  a  new  discretization  has  been  employed 
which  facilitates  convergence. 

I.  Introduction 

In  the  past,  device  modeling  by  direct  solution  of  the  Boltzmann  transport  equation 
(BTE)  was  usually  considered  too  difficult  to  be  achieved.  The  difficulties  were  mainly 
due  to  dimensionality  problems  (the  steady-state  BTE  is  a  6-dimensional  equation),  and 
problems  evaluating  the  complicated  collision  integral. 

Recently,  however,  the  Legendre  polynomial  (LP)  technique  has  been  demonstrated 
to  provide  fast  and  accurate  solutions  to  the  BTE(1,2,3].  Use  of  the  LP  technique  al¬ 
lows  one  to  overcome  problems  of  dimensionality,  wd  also  facilitates  evaluation  of  the 
collision  integrals.  However,  until  now  the  method  was  used  only  for  very  simple,  largely 
unrealistic  test  structures[2],  or  only  as  a  post-processor  to  hydrodynamic  simulations[3]. 
Here,  we  overcome  these  limitations  and  adapt  the  new  LP  technique  to  actually  model 
BJT’s. 

n.  General  Approach 

In  the  present  work  we  self-consistently  solve  the  electron  BTE,  the  Poisson,  and 
hole-continuity  equations  to  obtain  the  electron  momentum  distribution  function  for  an 
entire  prototype  BJT.  We  first  transform  the  BTE  into  a  tractable  expression  using  a 
first  order  LP  expansion:  this  reduces  the  dimensionality  of  the  system,  and  allows  us  to 
integrate  the  collision  terms  analytically. 

To  numerically  solve  the  system,  we  first  discretize  the  BTE  using  a  new  Scharfetter- 
Gummel-like  algorithm  which  employs  special  state  variables.  After  discretizing  the 
Poisson  and  hole-continuity  equations,  the  overall  nonlinear  system  is  then  solved  using  a 
Gummel-type  iteration  scheme.  This  algorithm  provides  nearly  exponential  convergence 
to  efficiently  obtun  the  distribution  function  for  the  entire  device  without  the  statistical 
noise  which  is  characteristic  of  Monte  Carlo  methods. 


in.  The  Boltamann-PoiMon-Continuity  System 

The  device  model  consists  of  the  Poisson  equation  ( 1 ),  the  space-dependent  BTE  for 
electrons  (2),  and  the  current  continuity-equation  for  holes  (3).  The  collision  integrals  in 
the  BTE  account  for  the  effects  of  acoustic  and  intervalley  phonons,  as  well  as  impact 
ionization  scattering.  These  scattering  cross-sections,  as  well  as  nonparabolic  conduction 
band-structure  V2dues,  are  identical  to  those  employed  in  Monte  Carlo  calculations[4]. 


v;*<r)  =  -  ln(r)  -  p(r)  +  Ntit)  -  ;VD(r)J 


^Vk£.V,/(k,r)-H^V,^(r)-Vk/(k,r)  = 


3/(k,r) 

dt 


g/(k,r) 

at 


at 


•  (ppp(r)Vr^(r)  -I-  MpV; V,p(r)]  =  n,  p)  (3) 

where  n(r)  =  J  /(k,r)dk  is  the  electron  concentration;  p(r)  is  the  hole  concentra¬ 
tion;  ^(r)  is  the  potential;  Nd{t)  and  Nx(r)  are  the  doping  concentration  for  donors 
and  acceptors;  /(k,r)  is  the  distribution  function;  R  is  the  recombination  rate,  including 
impact  ionization  and  Shockley- Read- Hall  (SRH)  recombation;  Vt  =  KaT/e-,  the  sub¬ 
scripts  ac,  tv,  it  correspond  to  acoustic  phonons,  intervalley  phonons,  impact  ionization, 
respectively. 


rV.  Method  of  Solution: 

1.  BTE  Formulation:  Legendre  Expansion 

We  first  express  the  distribution  function  in  terms  of  Legendre  polynomial  basis  func¬ 
tions: 

/(k,  r)  =  /(k,  x)  =  /o(« ,  x)  -k  kg{e,  x)cos6  (4) 

where  9  is  the  angle  between  V,^(r)  and  k;  /o(£t®)  kg{e,x)  represent  the  coef¬ 
ficients  of  the  symmetrical  and  the  anti-symmetrical  parts  of  the  distribution  function 
respectively. 

In  solving  the  BTE,  our  objective  will  be  to  determine  the  unknown  coefficients  /o 
and  g.  To  determine  these  coefficients,  we  next  substitute  the  Legendre  expansion  into 
the  BTE,  and  use  symmetry  to  obtain  2  equations  for  the  2  unknowns  /o  and  g. 

2.  Numerical  Solution  of  BTE:  A  Scharfetter-Gummel-Like  Approach 

At  this  point,  the  typical  approach  would  be  to  discretize  and  try  to  solve  the  re¬ 
sulting  equations  directly.  However,  this  direct  approach  would  lead  to  a  discrete  matrix 
which  is  ill-conditioned,  would  not  readily  account  for  the  exponential  variation  in  the 
distribution  function,  and  would  inhibit  obtaining  a  solution  to  the  overall  Boltzmann- 
Poisson-Continuity  system. 

To  overcome  numerical  problems,  and  routinely  solve  the  coupled  system,  we  devel¬ 
oped  a  Scharfetter-Gummel-like  discretization  scheme  to  resolve  the  exponential  behavior 
of  the  distribution  function.  This  scheme  enhances  the  diagonal  elements  of  the  discrete 
coefficient  matrix,  and  helps  to  numerically  account  for  the  rapid  variation  in  the  dis¬ 
tribution  function.  With  this  approach,  /o(£,  x)  is  expressed  as  the  Slotboom-like  form 
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/o(ff,*)  =  n(at)^(c,i);  and  n(x)  is  given  as  njii(x)exp(^(i)/V|).  We  then  substitute 
these  new  variables  into  the  symmetrical  and  antisymmetrical  equations  obtained  from 
the  oripnal  BTE,  and  discretize  using  finite  differences.  We  next  analytically  integrate 
the  rapidly  varying  part  of  the  difference  equations  between  grid  points.  This  allows  us 
to  account  for  the  exponential  variation  of  the  distribution  function  analytically,  thereby 
alleviating  the  computer  of  much  of  the  burden.  The  discretization  is  then  completed  to 
yield  a  matrix  equation  with  significantly  enhanced  diagonal  terms.  This  discrete  ver¬ 
sion  of  the  BTE  is  then  solved  using  SOR-type  iterations  in  the  real-space  domain  and 
sparse-matrix  Gaussian  elimination  in  energy-space. 

3.  Self-Consistently  Solving  the  Coupled  System 

With  a  robust  method  for  solving  the  BTE  developed,  the  entire  coupled  Boltzmann- 
Poisson-Continuity  nonlinear  system  is  solved  using  a  Gummel-type  iteration  scheme. 
The  Poisson  equation  is  directly  discretized  and  solved  with  spsuse  matrix  algebra.  The 
hole-current-continuity  equation  is  discretized  with  the  standau-d  Scharfetter-Gummel 
approach,  and  solved  directly  with  sparse  matrix  Gaussian  elimination.  To  facilitate 
convergence  of  the  overall  scheme,  special  damping  and  weighting  factors  have  been 
developed  which  help  to  guide  each  Gummel  iteration  toward  the  proper  solution.  The 
flow  chart  of  this  numerical  procedure  is  shown  in  Fig.  1. 

V.  Results 

Simulation  results  of  a  submicron  n* fpfn~  fn*  BJT  are  shown  in  Figs.  2  to  8.  In 
Fig.  2  the  prototype  BJT  structure  is  shown.  Fig.  3  shows  the  distribution  function 
for  the  entire  device.  In  Figs.  4  and  5  we  show  calculated  values  for  average  velocity, 
carrier  concentration,  average  energy  and  electric  Held.  The  figures  also  show  that  good 
agreement  with  MC  calculations,  which  employ  the  same  transport  model,  was  obtained. 
It  is  worth  noting  that  velocity  overshoot,  which  is  characteristic  of  non-equilibrium 
electron  transport,  is  observed  near  the  p/n~  junction.  To  demonstrate  the  robustness 
of  the  algorithm,  in  Figs.  6  to  8  we  show  results  of  average  energy  and  velocity,  as  well 
as  ionization  coefficients,  calculated  for  a  large  range  of  applied  biases. 

VI.  Conclusion 

We  have  developed  an  accurate  and  stable  approach  to  BJT  simulation  by  the  direct, 
self-consistent  solution  to  the  Poisson,  hole-continuity  and  electron- Boltzmann  equations. 
The  method  calculates  the  distribution  function  for  the  entire  device.  Furthermore,  the 
method  uses  less  than  1/100  the  CPU  time  required  by  similar  MC  calculations. 
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la^  Ol  ONliMwn  (unction  Moyulud* 


Fig.  1.  The  flow  chart  illustrates  the 
Gonunel  algorithm  used  in  this  work. 


Fig.  3.  The  energy  distribation  function 
as  a  function  of  position  along  the  BJT. 


Fig.  2.  The  prototype  BJT  modded  by 
direct  solution  of  the  Boltzmann-Poisson' 
Continuity  system. 


Fig.  4.  Dectron  concentration  (dash  line) 
and  avCTage  velocity  (solid  line)  calculated 
by  the  new  method.  Average  velocity  from 
MC  calculations  (open  circles)  is  also  plot¬ 
ted. 


Fig.  5.  Electric  fidd  (dash  line)  and  aver¬ 
age  energy  (soUd  line)  calculated  with  this 
direct  method.  MC  calculations  are  also 
plotted  (open  circles). 
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Fig.  6.  Awragt  dcctron  m* 
ergy  »■  a  fnactioa  of  pocitioo 
along  thn  BJT.  The  simnlatioa* 
easily  converge  for  a  wide  range 
of  appUed  biases  using  the  Schar- 
fetter  Gnmmel-Uke  discretiza¬ 
tion  of  the  Boltzmann- Poisson- 
Con  tinnity  system. 


Fig.  7.  Average  electron  ve¬ 
locity  as  a  function  of  position 
calculated  by  the  Boltzmann- 
Poisson-Continnity  system  for 
a  wide  range  of  applied  biases. 
Velocity  overshoot  is  clearly  pre¬ 
dicted  by  the  method. 


Fig.  8.  Values  for  impact  ion¬ 
ization  coefficient  calculated  tor 
a  wide  range  of  applied  bias 
voltages.  It  is  clear  that  im¬ 
pact  ionizaticm  is  maximum  in 
the  high-fidd  collector  region. 
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Abstract 

A  new  self-omsisteiit  charge-control  model  which  includes  models  for  realistic  trapping 
inocesses  is  described  for  a  high  electron  mobility  transistor  (HEMT).  This  generidised 
HEMT  model  takes  into  account  die  effects  of  deq>  levels  both  in  semi-insulating  GaAs  and 
doped  AlGaAs  layers.  It  is  shown  diat  conventkmal  charge-control  models  are  insufficient 
to  describe  HEMT  operation  near  pinch-off  and  in  the  normally-on  mode.  The  model  is  used 
to  investigate  the  effects  of  Fermi-tevel  pinning  at  the  semi-ingiiating  interface  on  the  device 
performance  and  is  shown  to  lead  to  less  injection  of  electrons  into  substrate. 


I.  Introducdmi 

An  understanding  of  the  trailing  mechanism  in  HEMTs  is  very  important  for  developing 
accurate  models  for  HEMT  characterisation.  Trapping  of  electrons  in  doped  AlGaAs  layer 
and  semi-insulating  (SI)  GaAs  substrate  is  not  only  responsible  for  different  anomalies 
observed  at  cryogenic  terrqieratures  but  limit  the  device  performance  at  300K  as  well. 
Exarrqiles  include  the  colhqise  of  I-V  duuacterisdes,  decrease  in  both  maximum 
transconductatKe  and  in  the  gate  voltage  swing,  shift  in  pinch-off  voltage,  slow  transients  in 
switching,  and  generatkm-recombiruttion  noise.  These  effects  become  nxne  prcmounced  in 
very  short  chatmel  HEMTs  for  millimeter-wave  qiplicadons.  Although  trqipmg  phenomena 
are  very  inqiortant  in  determining  the  diaracteristics  of  HEMTs  and  there  has  been  significant 
amount  of  research  directed  towards  understanding  the  basic  physics  of  deep  levels  in  defied 
AlGaAs  layers[l]-[2]  and  SI  substrate  [3]-[4],  diete  has  been  little  theoretical  effort  devoted 
to  incorporating  bnqiping  medianisms  in  the  device  models.  The  influence  of  Fermi-level 
pinning  at  die  GaAs  substrate  on  HEMT  toleration  has  been  discussed  by  Krantz  et  al  [5] 
using  analytical  eiqffessions.  Shauid  et  al  [6]  have  included  in  their  two-dimensional  model 
die  trappir^  process  associated  with  the  dt^ed  AlGaAs  layer  by  asMiming  that  all  donors  are 
electrically  active  as  DX  centers.  They  demonstrated  that  die  DC  transconductance  can  be 
lowered  as  much  as  60%  due  to  electrtw  trqiped  in  the  doped  AlGaAs  layer.  Japanese  wo^ 
[7]  has  confirmed  diat  DX  centers  are  also  i»esent  in  n-type  GaAs  and  become  the  ground 
state  when  pressure  exceeds  20  K  bar.  This  result  provides  die  strong  evktence  that  DX 
properties  are  associated  widi  isolated  donors.  Ihe  measurement  work  done  at  IBM  Thomas 

J.  Watson  Reaeardi  Center  [2]  has  findier  shown  diat  diere  is  a  strong  variation  of  trapping 
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kinetics  widi  chaoge  in  aUoy  conqxMitkm  of  Al,Gai.,As.  These  results  strmigly  suggMt  diet 
trqipii^  models  should  include  more  than  one  doom'  level. 

In  our  recent  pqper  [8],  we  have  presented  a  more  realistic  diree-kvel  tiding  model  Ah' 
doped  AlGaAs  layer.  In  dits  p^ier  we  {nesod  a  ziew  self-ctHnistem  dmTge-coatroi  model 
which  ipchMies  realistic  trqjpii^  processes  in  both  the  doped  AlGaAs  and  SI  substrate.  Thr 
model  provides  a  natural  explanation  of  die  published  experimental  results  related  to  trapping 
mechanisms  and  demonstrates  the  role  of  deep  level  states  on  the  operatkm  of  single-,  multi¬ 
channel,  inverted  and  pseudomorphic  HEMTs. 


n.  The  Charge-Contnd  Modd 

In  this  section  a  self-consistent  charge-control  model  incorporating  deqi  level  effects  is 
developed  by  modifying  the  Poisson  equation  to  include  trapping  effects.  Based  on  the  results 
of  a  self-consistent  quantum  mechanical  model,  which  shows  that  most  of  die  electron  reside 
in  first  three  quantum  energy  levels,  and  on  the  study  of  Yoshida  [9]  that  a  classical  model 
with  Fermi-stadstics  predicts  the  device  performance  with  good  accuracy,  a  fast  and  accurate 
charge-control  model  is  developed.  The  present  model  is  based  on  self-consistendy  solving 
Poisson  and  SchrSdinger  equations  for  a  maximum  of  iqi  to  three  quantum  energy  levels  and 
applying  a  Haasical  model  with  Fermi-statisdcs  for  the  remaining  electrons.  The  self- 
consistent  quantum  mechanical  model  has  been  described  in  the  literature  [9]-[10]  and  only 
the  broad  principle  will  be  described  here.  The  electrostatic  potential  is  related  to  charge- 
distribution  by  Poisson  equation 

+  pW  *  0  (1) 


where  p  is  the  net  local  charge  density,  and  the  envelope  wavefiinction  for  the  tth  sub-band 
satisfies  the  schrodinger  equation 


d 

f  1 

dx 

^m*  dx  j 

=  0 


(2) 


where  m*  is  the  effective  mass,  is  the  eigen  energy  for  the  fth  sub-band  and  V,  is  the 
potential  including  the  local  exchange  correlation  coefficient.  The  Fermi  level  is  assumed  to 
be  constant  throughout  the  semiconducting  layers  in  equilibrium  and  the  self-consistent 
solution  of  equations  1  and  2  gives  the  accurate  band  profile,  sub-band  energies  and  Fermi 
level. 


A.  The  Substrate  Tn^t  Model 

A  high  density  of  both  donor-  and  acceptor-like  tnqis  exists  in  the  SI  substrate.  It  is  generally 
accepted  that  there  are  iq>  to  4  traps  which  are  important  in  determining  the  properties  of  SI 
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GtAt  [3]-[4]  and  only  dieae  tr^M  are  included  in  die  present  chaixe-coolxol  model.  The  first 
d dieae  is  EL2  level  deq>  donor  which  resides  at  an  energy  levri  O.SeV  bdow  the  conduction 
band  edge  in  concentrations  ranging  between  10*  and  10°  m*’.  The  shallow  donor  and 
aocqnor  trqis  ate  generally  present  in  omcentration  of  about  10°  m'’.  The  shallow  donor  is 
attributed  to  silictm  introduced  fiom  die  walls  of  reactor  vessel  and  shallow  accqum  is 
associated  to  cathtm.  Finally,  a  deqi  accqxor  level  is  fneseat  at  an  enogy  level  about  0.8 
eV  sdiove  the  valence  band  e^.  The  local  net  charge  deady  in  SI  substrate  is  givoi  by 

ndiere  is  the  doping  density,  Non>  ^fon  ate  die  net  densities  of  positive  tr^iped  charge  in 
die  shallow  and  deep  donw  trqis  and  N^rd  are  the  ntt  densities  of  negative  tn^iped 
charge  in  shallow  and  deqi  accqitors  tnqis  reflectively. 


B.  Ike  Ik^ed  Quuuul  TNp  Model 


The  doped  AlGaAs  layer  introduces 
several  undesirable  effects  for 
HEMT  operation.  They  ate  mainly 
related  to  large  number  of  deqi 
stales  which  become  dominant  for 
A1  mole  fractions  2  0.22.  The 
doped  layer  trqiping  mechanism  is 
incorporated  in  die  present  charge- 
control  model  based  on  sinqile  three 
donor-level  model  which  is 
described  in  detail  in  our  recent 
pqier  [8].  However,  as  we  will 
refer  to  it  in  this  pqier,  a  brief 
description  of  it  is  included  here  for 
the  sake  of  cooqileteness.  The 
doped  donor  atoms  can  occupy 
either  of  the  three  donor  levels, 
shallow,  deqi,  or  DX.  The  total 
dtmor  doping  density  in  our  model 


Figure  1.  Omduction  band  energy  of  a  single  channel 
HEMT  along  die  transverse  axis  under  the  gate. 


is  given  by  the  sum  of  the  densities  of  die  three  donor  levels.  This  model  is  consistent  with 
die  models  treated  dieoretically  by  Morgan  [1]  and  also  accounts  for  experimental  results 
obtained  1^  Motmey  q  al  [2].  The  local  net  char;^  density  in  AlGaAs  layer  is  given  by 

(4) 


v^iere  are  reflectively  die  net  densities  of  electrons  trqiped  in  die  diallow, 

deqi  and  DX  levels. 


C.  The  Tn^  FfBing  Factor 


la  HEMTs,  Ac  deaii^  of  mobile 
cmien  dqpcade  oo  die  electron 
tnpfdag  in  die  trq>  levels.  Tbe 
lil^  oi  dieee  trqi  levels  is 
calculated  based  on  a  Shockley- 
Read-HaU  model  [11H12]  for 
recondnaation  dnou^  a  single 
level.  The  donor  tnps  are  positive 
adien  empty  (i.e.  adtenocaqiied  by 
hcdes)  and  neutral  when  filled  (i.e. 
ndien  occiyied  Ity  electrons),  and 
accqitm  trqis  are  negative  when 
filled  and  noitral  when  enqity. 


m.  Results  and  Disnissions 


TnrawaiM  Am  Undar  Tha  Gaia  (^) 


Figure  2.  Electroo  density  of  a  single  channel  HEMT 
Figures  1  and  2  reflectively  show  akmg  the  transverse  axis  under  die  gate, 
the  single-channel  HEMT’s 


conduction  band  energy  and  electron  density  along  the  transverse  axis  die  gate,  with 
and  without  the  SI  substrate  trfiping  effect.  Tbe  siinulatkm  results  have  shown  dial  trfiping 
in  die  SI  substrate  have  a  significant  effect  near  pinch-off.  The  pinning  of  die  Fbrmi  level  at 
die  SI  interface  makes  the  conduction  band  nune  ataqit  and  an  interfile  dqdetxm  region  is 
fcnmed.  This  results  in  decreased  election  ^knsity  near  die  SI  due  to  narrowing  of 

the  channel.  At  tbe  same  tune  diis 


leads  to  greater  confinement  of 
channel  electrons  and  less  substrate 
electron  injection.  Due  to  improved 
carrier  confinement  with  die  SI 
substrate,  the  device  has  good 
pinch-off  characteristics. 

The  simulation  results  have  shown 
diat  defied  layer  trqiping  effects 
become  more  infxntant  as  tbe  gate- 
source  voltage  is  made  more 
positive.  This  is  due  to  die  increase 
in  parallel  current  in  doped  AlGaAs 
layer  and  saturation  of  two- 


CendueHon  Band  Enaray  (aV) 


dimensional  electrmi  gas  which  no  Trantv«MAmUndarThaQm(^) 

longer  refionds  to  variation  in  gate 

voltages.  This  demonstrates  die  Figure  3.  Conduction  band  energy  and  electron  density 
decrease  in  tians-  of  •  two  channel  HEMT  along  die  transverse  axis 

conductance  and  collfise  of  IV  under  die  pte. 


characteristics  at  high  gate 

voltt^.  Figure  3  shows  the  charge-control  results  of  a  two-channel  AlGaAs/GaAs  HEMT 
and  demonstrates  diat  doped  layer  tnqfiiiig  effect  is  dmninant  for  all  die  gate-biases.  In  multi- 
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dnand  HEMTt,  tlie  effect  topping  is  mote  pnmounced  in  die  aecood  doped  layer  becttue 
parallel  current  in  H  flows  for  afe^  all  fee  gale-bias  v<fei^.  In  pseudomorphic  HEMTs, 
fee  effect  (rf  fee  SI  substrate  is  less  pronounced. 


IV.  Coariiisloos 

We  have  developed  a  new  self-consistent  charge-control  model  for  HEMTs  feat  includes 
trqiping  effects  in  fee  SI  GaAs  substrate  and  doped  AlGaAs  layer  and  i»ovides  a  phjrsical 
undostanding  of  short  channel  HEMT  operation.  It  is  oqiected  feat  the  present  diarge-control 
model  will  be  useful  in  developing  mme  realistic  and  accurate  models  for  HEMTs. 
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Abatmct 

The  validity  of  energy-dependent  relaxation  timet  often  used  in  a  GaAt  hydrod3mamic 
modd  hat  been  carefnlly  examined  udng  the  telf-conditent  Monte  Carlo  simulation.  We 
have  found  that  those  transport  coeffidents  assodated  with  the  intervalley  transfer  from 
the  lower  to  the  upper  valley  are  not  single-valued  functions  of  the  averaged  electron  en¬ 
ergy  in  the  valley.  If,  instead,  the  valley  population  ratio  or  the  average  energy  weighted  by 
the  valley  population  ratio  is  used  a  substantial  improvement  in  accuracy  can  be  achieved. 

I.  Introduction 

Conventionally,  macroscopic  (averaged)  relaxation  timet  such  as  r^,  r,  and  rw  appearing 
in  the  hydrodynamic  (HD)  transport  equations  ate  assumed  energy-dependent  and  deter¬ 
mined  by  performing  Monte  Carlo  (MC)  calculations  under  steady  state  and  homogeneous 
field  conditions.  These  expressions  are  often  extended  to  the  case  of  inhomogeneous  fields 
without  any  justification.  Sandbom  et  al.  [1],  using  the  MC  simulation,  found  that  both 
the  energy  and  momentum  relaxation  times  in  an  equivalent  single- valley  model  under  the 
transient  condition  differ  very  much  from  the  steady  state  and  homogeneous  field  values. 
Yamada  [2]  also  observed  the  discrepancy  in  the  relaxation  times  between  the  homoge¬ 
neous  and  inhomogeneous  field  conditions.  He  stiggested  that  the  relaxation  times  should 
depend  not  only  on  the  energy  but  also  on  the  valley  population.  In  this  study,  instead  of 
a  single-valley  model,  a  three-valley  HD  modd  for  GaAs  has  been  devdoped.  The  tiaus- 
port  coefildents  appearing  in  the  HD  modd  for  homogeneous  and  inhomogeneos  fidd 
conditions  are  evaluated  by  a  single  partide  MC  simulation  program  and  a  multi-partide 
self-consistent  MC  simulation  program,  respectivdy.  A  one  dimensional  N*  —  N  —  N* 
GaAs  ballistic  diode  was  used  as  a  test  device.  This  approach  allows  us  to  ligoroudy 
examme  the  validity  of  the  energy  dependence  of  each  relocation  time  as  well  as  provides 
us  valuable  information  for  a  more  appropriate  description  of  the  relaxation  times. 


n.  Moments  of  the  Boltsmnnn  Trsmsport  Equation 


The  HD  transport  equations  can  be  obtained  by  taking  various  moments  of  the  Boltsmann 
transport  equation  (BTE)  [3].  Extending  the  work  of  [4],  [5]  to  a  multi- valley  system,  we 
obtain  the  following  steady-state  conservation  equations  for  the  tth  valley: 


V.(ml?0  =  [-—-  — +  —  +  —], 

TnH 
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At  liij 

fit  ^Wu  Aj  ftji  Ai  T'liii 

+  (4) 

Ai  Mm  MWi  M<tt 

where  Vi  —  (vi),Ui  —  {vihki),Wi  =  {ei),Si  =  (^e<)  And  ^  =  {eivihki).  In  the  conven¬ 
tional  HD  transport  model,  the  transport  coefficients  (rn,M>fW,etc.)  are  usually  assumed 
to  depend  on  the  average  energy, This  approach  ignores  the  dependence  of  the 
transport  coefficients  on  the  shape  of  the  distribution  function. 


m.  Self-Consistent  Monte  Carlo  Simulation 

To  examine  the  accuracy  of  energy-dependent  transport  coefficients,  we  begin  with  a 
tigous  solution  of  the  steady-state  BTE  by  the  MC  method.  The  simulation  program 
uses  an  analytical  multi-valley,  non-parabolic  band.  The  following  t3rpes  of  scattering  ate 
taken  into  account;  acoustic  phonon  scattering,  optical  phonon  scattering,  polar  optical 
phonon  scattering,  ionised  impurity  scattering,  equivalent  and  nonequivalent  intervalley 
scattering.  The  various  scattering  parameters  are  to  those  used  in  [6].  In  this 

work,  a  one-dimensional  N*  —  N  ~  N'*'  GaAs  structure  with  a  0.5  fim  N-region  was  ex¬ 
amined.  The  doping  densities  of  the  three  layers  were  N4  =  I  x  lO^^cm"*,  1  x  10^*cm“* 
and  1  X  10^^em~’,  respectively.  The  applied  bias  was  2.0  volts.  Fig.  1  displays  the  doping 
density  and  electric  field  profiles  within  the  device  as  obtained  firom  the  self-consistent 
MC  (SCMC)  simulation.  Fig.  2  shows  the  F—  valley  velocity  an'^  energy  profiles. 


rV.  Results  and  Discussions 

At  each  position  within  the  device  we  evaluated  the  transport  coefficients  and  the  aver¬ 
age  energy  in  each  valley  by  the  SCMC  program.  We  found  that  these  coefficients  axe 
generally  a  function  of  the  local  average  energy  in  the  valley  except  for  those  due  to  the 
intervalley  transfer  from  the  lower  to  the  upper  valley  (i.e.  F  — »  L,  F  — »  X,  and  L  — »  A'.). 
Figs.3,5  and  7  respectively  display  the  F  — »  X  intervalley  transfer  coefficients,  TnrL,f*ri 
and  Turn  versus  the  average  F— valley  energy,  Wr,  for  both  the  homogeneous  and  the  in¬ 
homogeneous  field  calculation.  The  "hysteresis”  loops  clearly  indicate  that  none  of  them 
can  be  described  as  a  single-valued  function  of  Wr-  For  these  cases,  the  energy-dependent 
transport  coefficients  which  were  obtained  from  the  homogeneous  field  calculation  always 
underestimate  the  actual  one  in  the  increasing  field  region  and  overestimate  the  same  in 
the  decreasing  field  region.  If,  instead,  Wr  is  weighted  by  the  valley  population  ratio  (i.e., 
^Wr)  then  the  hysterisis  loop  for  r„r£  can  be  significantly  reduced  (see  Fig.  4).  This  is 
motivated  by  the  fact  that  the  valley  population  ratio  more  or  less  reflects  the  fraction  of 
electron  population  which  have  sufficient  energy  to  transfer  from  the  lower  to  the  upper 
valley.  We  also  fotuid  that  using  the  valley  population  ratio  alone  (i.e.,  the  hysterisis 
loop  can  be  considerably  reduced  for  /tri  utd  m  seen  in  Figs.6  and  8,respectively. 


The  Ksult  for  to'L  i>  very  limilax  to  twtl  uid  that  for  i*  nmilar  to  itri-  Once  the 
hysteriais  loop  is  reduced,  these  transport  coefficients  can  now  be  modelled  empirically  as 
single-valued  functions  of  the  valley  population  ratio  or  the  energy  weighted  by  the  valley 
population  ratio. 


V.  Conclusions 

A  SCMC  simulation  program  was  used  to  examine  the  conventional  assumption  of  energy- 
dependent  transport  coefficients  in  a  multi-valley  system.  We  found  that  the  transport 
coefficients  related  to  the  intervalley  transfer  from  the  lower  to  the  upper  valley  (i.e., 
T  —*  L,  T  —*  X  and  L  —*  X)  s^e  not  a  single-valued  function  of  the  average  energy 
in  the  valley.  A  substantial  improvement  in  the  accuracy  can  be  achieved  if  the  valley 
population  ratio  or  the  average  energy  weighted  by  the  valley  population  ratio  is  used. 
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Abstract 

The  three-dimensional  (3D)  depletion  approximation  is  computed  numerically  by  solution 
of  a  modified  vo'sion  of  Poisson’s  equation  for  the  electrostatic  potential.  Nonlinearities 
aire  resolved  by  a  globally  convergent  modification  of  Newton’s  method.  The  linear  systems 
are  solved  by  3D  multigrid  routines. 

I.  Introduction 

Computational  algorithms  to  resolve  the  nonlinearities  in  numerical  drift-diffusion  model¬ 
ing  such  as  Newton’s  or  Gummel’s  method  generally  require  a  reasonably  accurate  initial 
guess.  The  charge  neutral  approximation  provides  one  possible  option  that  is  obtained 
with  little  computational  effort.  An  alternative  approximate  solution  is  provided  by  the 
depletion  approximation,  which  tends  to  be  presented  in  the  context  of  analytical  one¬ 
dimensional  modeling.  In  this  paper  we  present  numerical  results  obtained  with  a  novel 
computational  multi-dimensional  implementation  of  the  depletion  approximation. 

If  q  is  the  size  of  the  electron  charge,  T  the  ambient  temperature,  and  kg  Boltzmann’s 
constant,  then  the  thermal  potential  Ut  =  {kBT)lq.  The  intrinsic  Debije  length  A©,,  is 
then  equal  to  Xo.i  =  y/{^fokBT)l{q^ni).  We  express  length  in  terms  of  an  arbitrary  unit 
Iq.  The  nonlinear  Poisson  equation,  in  terms  of  the  electrostatic  potential  u  (in  units  of 
the  thermal  Ut)  and  the  quasi-Fermi  levels  v  and  w,  is  then  given  by 

-V’u-|-(-r^)*[e’‘-’'-e'"-*-ifei]  =  0.  (1) 

Ad,< 

In  thermal  equilibrium  the  quasi-Fermi  levels  v  and  w  may  be  set  equal  to  the  constant 
value  0.  In  the  chaurge-neutral  aqxproximation  the  potenti2d  function  u  is  set  equal  to 

Uc»  =  sinh“^(ki/2).  (2) 

However,  it  is  obvious  that  the  charge  neutral  approximation  Ucn  is  significantly  less 
accurate  as  an  estimate  to  the  solution  u  to  Eq.  (1)  if  v  and  tj;  vary  significantly,  reflecting 
applied  bias  potentials.  The  charge-neutral  approximation  vnll  be  particularly  inaccurate 
in  modeling  the  depletion  around  reverse  biased  p-n  junctions. 

Our  algorithm  to  compute  the  multi-dimensional  depletion  approximation  employs  a  mod¬ 
ification  of  the  nonlinear  Poisson  equation  (1 )  in  which  we  attempt  to  reduce  mobile  charge 
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deositMs  to  0  where  the  solution  is  furthest  fr(»n  thermodynamic  equilibrium.  The  mod¬ 
ified  Poisson  equation  is  givm  by 

-  -I-  ^ j  +»  -ki]  =  0  (3) 

where  U4tpt  i>  the  potential  distribution  which  accounts  for  all  depletion,  and  w^e 
are  the  piece-wise  constant  quasi-Fermi  levels,  uu  is  the  thermal  equilibrium  potential 
distribution,  and  vu  is  the  quasi-Fermi  level  in  thermal  equilibrium  (whidi  is  ?ero).  The 
nonlinearities  are  resolved  with  Newton’s  method  and  the  linear  systems  are  solved  with 
three-dimensional  multigrid  routines  developed  by  the  second  author  [1].  Special  care 
was  taken  to  keep  the  linear  systems  well-conditioned.  First,  we  solve  E^q.  (1)  without 
externally  applied  bias  for  ut,.  Then  the  extnnally  iq>plied  biases  are  iq)plied  at  the 
cmitacts.  The  quasi- Fermi  levels  are  adjusted  in  a  piece- wise  constant  nutnner  in  each 
dt^ing  region  to  match  the  applied  bias  for  the  contact  on  that  region.  Then  Eq.  (3)  is 
solved  for  the  complete  depletion. 

IL  Numerical  Software 

The  linear  multilevel  method  at  the  core  of  the  software,  as  described  in  [1,  2],  allows 
for  discontinuous  coefficients  as  occur  in  material  interface  problems.  Operator-induced 
prolongation  procedures  are  iised  to  enforce  flux  conservation  at  box  boundaries  when 
a  mesh  function  is  interpolated  from  a  coarse  to  a  fine  mesh.  Smoothing  operators  are 
Red/Black  Gauss-Seidel  and  weighted  Jacobi,  whereas  the  coarse  mesh  problem  is  solved 
with  conjugate  gradient  methods.  Nonlinear  problems  are  handled  with  an  extremely 
robust  globally  convergent  damped-inexact-Newton-znultilevel  solver  based  on  fast  linear 
multilevel  methods  for  the  inexact  Jacobian  system  solves.  A  nonlinear  prolongation 
operator  has  been  employed  for  nested  Newton  iteration.  The  discretization  is  with  the 
box  method. 

To  handle  the  severe  numerical  problems  occurring  with  nonlinearities  of  exponential-type 
present  in  the  semiconductor  equations,  we  developed  argument-capping  functions  which 
avoid  nonvectorizable  statements.  Calls  to  the  standard  intrinsic  functions  are  replaced 
by  these  modified  functions,  and  overflows  are  successfully  avoided  during  early  transient 
iterations  without  loosing  the  execution  efficiency  of  the  intrinsic  functions.  Due  to  the 
various  choices  made  during  the  development  of  this  package,  the  software  executes  at 
very  high  rates  on  a  number  of  modem  computers;  see  for  example  [1,  2]  for  benchmarks. 

in.  Numerical  Results 

In  the  following  pages,  we  show  the  initial  charge  neutral.  Fig.  1,  the  thermal  equilibrium, 
Fig.  2,  and  the  final  depleted.  Fig.  3,  electron  depletion  for  a  three-dimensional  BJT 
model.  The  computational  grid  was  49  x  49  x  25  yielding  a  total  of  60, 025  unknowns. 
The  computation  of  the  our  depletion  model  took  48  seconds  on  a  HP  735  workstation, 
demonstrating  the  considerable  computational  effidmcy  of  the  multigrid  routines. 
Figures  1,  2,  and  3  show  two  cut  planes,  one  on  the  top  on  the  of  the  device  and  (me 
down  the  middle.  The  emitter  is  near  the  center,  but  slightly  to  the  left,  the  base  (x>ntact 
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Table  1:  Timing  and  Iteration  Statistics 


Grid  Size 

Number 
of  Points 

Solution 

Time 

Newton  Iterations 
Thermal  Equil. 

Newton  Iterations 
Applied  Bias 

25  X  25  X  25 

15.02 

5 

6 

25  X  25  X  33 

20,625 

20.04 

5 

6 

33  x  33  x  33 

35,937 

40.16 

5 

7 

49  X  49  X  33 

79,233 

103.07 

6 

7 

49  X  49  X  49 

117,649 

152.23 

6 

7 

is  to  the  right  of  the  emitter,  and  the  collector  contact  is  near  the  right  edge.  The  lighter 
areas  of  the  planes,  representing  lower  electron  densities  (<  5  x  10**cm~’),  are  the  parts 
of  the  base  and  collector  which  have  been  depleted  of  electrons. 

The  device  simulated  is  a  BJT  built  in  an  area  of  1  fim  x  1  /xm  which  is  0.5  ftm  deep.  The 
device  has  a  realistic  doping  profile  [3,  4].  The  badcground  n-doping  of  10^^  cm“®  forms 
the  collector.  The  base  has  a  0.46  /xm  x  0.50  /xm  surface  area  and  a  depth  of  0.15  fim  with 
a  p-t)rpe  doping  of  10^*  cm The  n-type  emitter,  which  is  doped  at  10“  cm"®,  has  a 
0.08 /xm  X  0.13 /xm  area  and  a  depth  of  0.04 /xm.  This  base  width  corresponds  closely 
to  [5]. 

The  Debije  length  is  given  by  [6],  Lp  =  ),  where  Nb  is  the  doping 

concentration  on  the  more  lightly  doped  side  of  an  abrupt  junction.  The  Debije  length  of 
the  emitter-base  junction  in  this  device  is  4.1  nm  and  13  nm  for  the  collector-base  junction. 
The  depletion  width  of  a  junction  is  given  by  {6)  W  =  LD^‘2[9{yu  —  Vafi)/{kBT)  —  2), 
where  Vu  is  the  builtin  voltage  and  V^pi  is  the  applied  bias.  The  emitter-base  junction 
has  a  depletion  width  of  0.036 /xm.  The  collector-base  junction  is  0.10/xm  when  no  bias 
is  applied  and  0.22 /xm  when  Vfe  =  Vi  =  OV  and  Vc  “  3.0V. 

The  metallurgicai  junction  width  from  the  bottom  of  the  emitter  to  the  bottom  of  the 
base  region  is  0.103 /xm.  The  calculated  electron  depletion  region  (the  base  width  plus 
the  collector-base  depletion,  which  is  assumed  to  take  place  entirely  in  the  collector)  i£i 
0.208  /xm  for  the  zero  bias  case  while  it  is  0.327  /xm  for  the  biased  case.  Using  a  threshold 
of  a  factor  of  two  less  than  the  doped  concentration  as  the  edge  of  the  depletion  region, 
the  results  from  the  thermal  equilibrium  solution  gave  an  electron  depletion  region  of 
0.195 /xm  while  the  depletion  calculation  gave  0.303 /^m.  These  compare  favorably  to 
the  analytically  calculated  results;  however,  agreement  will  not  be  exact  due  to  three 
dimensional  effects  and  the  error  in  assuming  all  of  the  depletion  occurs  on  the  more 
lightly  doped  side  of  the  junction. 

The  width  of  the  undepleted  base  decreased  from  0.103 /xm  in  the  charge  neutrsd  case 
to  0.072 /xm  at  thermal  equilibrium.  When  the  final  depletion  correction  including  the 
bias  was  added,  the  base  width  was  0.059  /xm.  These  results  are  similar  to  experimental 
measurements  [7]. 

The  comparison  of  the  charge-neutral  approximation  tXen  and  the  multi-dimensional  de¬ 
pletion  approximation  demonstrates  clearly  the  superiority  of  the  computational  de¬ 
pletion  iq>proximation.  The  strong  rounding  of  the  electron  depletion  region  in  our  device 
with  an  exactly  rectangular  doping  profile,  demonstrates  the  need  for  a  3-D  calculation. 
Table  1  reflects  the  linear  computaticmal  complexity  of  the  calculation. 
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Figure  1:  Charge  neutral  solution.  The  light  area  of  both  planes  is  the  ba.se  region  while 

the  exterior  dark  region  is  the  collector  and  the  interior  dark  region  is  the  emitter.  This  is 

the  same  orientation  as  the  above  figure:  looking  up  from  slightly  below  the  device.  The 

contacts  are  on  the  top  plane. 
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Abstract 

Electronic  states  inside  heterostmctnre  derices  are  obtained  hj  solring 
Poisson's  and  Schroedinger's  equation  self-consistently.  Schroedinger's 
equatiiHi  is  efficiently  solred  by  using  rariaticmal  techniques  to  obtain 
the  ware  functions  in  teras  of  a  nuaber  of  expansion  functions.  The 
present  aethod  is  used  to  characterise  the  operation  of  single-well  and 
pseudoaorphic  heterostructures,  to  characterise  quantua  wire  dericesi  to 
calculate  the  two-diaensional  scattering  rates,  and  in  Nonte-Carlo  codes. 


I  Introduction 

The  physical  phenoaena  which  are  taking  place  in  ultra-saall  and  highly 
doped  heterostmctnre  derices  require  accurate  physical  aodels  to 
characterise  derice  operaticm  and  to  optiaise  the  stractures.  The  aain 
difficulty  is  that  the  beharior  in  the  two  diaensional  electron  gas  (2^6) 
region  depemis  in  a  coaplicated  aanner  on  the  derice  geoaetry  and  on  bias 
conditions.  The  self-consistent  solution  of  Poisson's  and  Schroedinger's 
equations  is  one  of  the  aost  accurate  aodels  which  is  used  to  characterise 
heterostracture  seaiconductors  to  orercoae  these  difficulties  [1,2]. 

Prerious  self-consistent  calcalations  generally  used  the  finite  difference 
technique  to  solre  Schroedinger's  equation  [3-5].  The  aesh  sise  and  the 
discretisation  aethod  deteriorated  the  accuracy  of  the  obtained  results. 
We  hare  recently  introduced  an  efficient  aethod  to  solre  Schroedinger's 
equation  by  using  rariational  techniques  [6].  In  the  present  work,  we 
apply  this  aethod  to  characterise  different  heterostracture  derices. 


II  Modeling 

The  effectire  aass,  one-diaensional  Schroedinger  equation  is  giren  by 
-  (h*/  2  a* )  (d^V'  (x)/d  x)  ♦  V(x)  V'  (*)  =  K  r  (x)  (1) 

t  III 

where  V(x)  aeans  potential  energy,  E  |  i  genenergy,  V'  (x)  wave  function 
corresponding  to  the  eigenenergy  E  ,  a  effectire  aass,  and  h  Planck’s 
constant.  For  a  seaiconductor  structure  of  width  a,  the  ware  functions 
can  be  expanded  as 
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n  n  X 


) 


(2) 


If  N  ia  infinite,  the  obtained  wave  functions  are  identical  with  the  true 

ones.  However,  a  finite  N  still  leads  to  very  good  accuracy.  The 

coefficients  a  ,  ,  the  eigenenergiea,  and  the  corresponding  wave  functions 
nk 

are  detemined  by  solving  the  natrix  equation  resulting  froa  the 
variational  integral  for  B  [6]. 


Ill  ^plication 


The  previous  sethod  is  applied  to  characterise  a  nodulation  doped 
structures  by  solving  Poisson’s  and  Schroedinger’s  equations  self- 
consistently.  The  2DEG  density,  the  energy  levels,  the  wave  functions,  and 
the  transconductance  are  displayed  versus  the  total  gate  voltage  in  figs. 
1,2,3,  and  4  respectively.  The  total  gate  voltage  represents  the  sun  of 
the  applied  external  voltage  and  the  Schottky  barrier  equivalent  voltage. 
Good  qualitative  agreenent  between  the  calculated  results  and  those  froa 
both  experinents  and  Nonte-Carlo  calculations  [7]  is  obtained. 
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Quantua  wire  structures,  shown  in  fig.S,  csn  also  be  characterised  by 
solving  Schroedinger’s  equation  in  two  dinensions  using  Bayleigh-Rits 
■ethod  to  obtain  the  two-diaensional  wave  functions  (fig. 6).  The  present 
■ethod  has  the  advantageous  that  it  requires  resonable  CPU  tine,  it  is 
straight  forward,  and  it  overcoaes  the  linitations  of  previous  nodels 
arising  froa  nesh  sise  and  discretisation. 


tiS.C  rirat  tno  anva  fonctiona  tor  tiM  atroctiira  in  fig.S  uaiag  10  axpan- 
aion  functiona  (figSa)  and  20  oapanaion  fanctioM  ifig.Ob)  in  anoli  diraction. 


ly  Calculation  of  the  two-diaensional  scattering  rates 


The  two-diaensional  scattering  rates  are  calculated  by  defining  the  natrix 
eleaent  for  scattering  between  the  ith  and  the  jth  subbands  according  to 

I  l"  =  J  I  »<«•<»  l'  I  i./H  I'  *>  <31 

where  Q,  q  are  the  phonon  wave-vector  coaponents  parallel  and  noraal  to 

the  hetero- interface,  and  I  (q)  neans  overlap  integral 

«■  J 

I  (q)  =  f  V'  (x)  V  (x)  expCiqx)  dx  .  (4) 

ij  J  i  j 

y'(x)  is  the  nomalised  envelope  wavef unction.  Both  Bayleigh-Bits  and 
finite  difference  aethods  are  iqiplied  to  calculate  the  subband  energies 
and  the  corresponding  wavef unctions  of  an  AlGaAs/GaAs  heterostructure. 
Using  Sayleigh-Sits  aethod,  the  required  CPU  tiae  (fig. 7b)  to  calculate 
the  2D  scattering  rates  (fig.S)  versus  the  nuaber  of  aubbands  is  nearly 
constant  while  it  greatly  changes  using  the  finite  difference  aethod 
(fig. 7a).  This  aakes  the  application  of  the  present  aethod  aore  practical 
in  particular  for  device  siaulation. 
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V  Noote-Carlo  siaulation  of  Hetero-FETs 

A  two-diaensional  Noote-Carlo  code  is  investigated  to  similate  the 
hetero-FBT  structure  shown  in  fig. 9.  Poisson's  and  Schroedinger's 
equations  are  sovled  self  consistently  along  the  device  and  the  obtained 
results  are  used  to  express  the  conduction  band  by  a  nuaber  of  step 
functions.  The  wave  functions  and  equivalent  energy  wells  at  different 
sections  along  the  channel  are  displayed  in  fig.  10. 
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The  presant  aodal  takes  the  sise  qaaotisatioo  into  account  because  the 
electrons  which  hare  mergies  less  than  the  barrier  height  can  only  nove 
in  two  dinensions.  Moreover,  the  transfer  between  the  different  regions  is 
sinpler  than  in  other  nodels  because  no  extra  scattering  rates  are 
required  to  be  derived.  Our  aodel  leads  to  a  higher  carrier  concentration 
in  the  channel  region  than  the  classical  nodels,  fig.  11,  because  the 
carrier  capture  in  the  2DEG  region  is  better  sinulated. 

VI  CcHiclosions 

An  efficient  co^Nitational  tool  is  used  to  detemine  the  electronic  states 
inside  heterostructure.  The  present  nodel  overcoaes  the  linitations  of  the 
previous  nodels  which  arise  froa  nesh  size  and  discretization.  The  closed 
foms  of  the  wave  functions  nakes  the  calculations  of  the  two-dinensional 
scattering  rates  easier.  This  nethod  can  also  be  efficiently  used  together 
with  Monte-Carlo  codes  to  characterize  seaiconductor  devices. 
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I.  Introduction 

We  present  a  new  technique  for  solving  the  Hydrodynamic(HD)  equations  in  submi¬ 
cron  device  simulations.  This  method  is  extremely  stable,  quickly  converges  even  with 
poor  initial  guesses,  and  agrees  with  Monte  Carlo(MC)  calculations.  In  2uldition,  this 
new  method  is  easy  to  code  for  2-D  device  simulation.  We  have  applied  the  new  method 
to  simulate  2-D  MOSFETs  as  well  as  SOI  devices. 

To  our  knowledge,  no  agreed-upon  HD  model  has  emerged  as  the  basis  for  a  stan¬ 
dard  device  simulator.  The  lack  of  an  industry-standard  CAD  tool  can  be  attributed 
to  the  difficulties  in  obtaining  an  accurate,  numerically  stable,  and  rapidly  convergent 
solution  to  the  HD  equations.  We  have  developed  a  new  robust  algorithm  for  HD  device 
simulation  that  overcomes  exi^r^^ing  difficulties.  We  formulate  the  HD  equations  into  self- 
adjoint  forms  with  a  new  set  oi  Slotboom-like  state  variables.  The  discretizations  result 
in  a  diagonally  dominant  coefficient  matrix  for  each  HD  equation.  Consequently,  the 
convergence  of  each  equation  is  guaranteed  for  any  initial  guess  when  iterative  solution 
methods  are  employed.  Our  discretization  technique  resolves  the  rapid  spatial  variations 
which  may  occur  in  carrier  densities  and  carrier  temperatures.  As  a  result,  stability  of 
the  HD  solution  is  improved.  Furthermore,  a  fixed-point  iterative  method  is  employed 
to  determine  the  solution  of  each  discretized  HD  equation.  A  direct  solution  of  a  matrix 
equation  is  therefore  avoided.  In  addition,  the  method  requires  little  memory,  and  is 
well-suited  for  parallel  computations. 

II.  The  HD  Model 

The  HD  equations  are  obtained  from  a  standard  HD  formulation  [1]. 


-(n-p-D) 

(1) 

-VJn  =  R 

9 

(2) 

^  Sn  ^  Jn  E  n  /rp  \ 

•ntsfyJ-n) 

(3) 

qUnTl  V  <^  +  qD„  V  «  +  Pn*Bn  V  Tn 

(4) 

s",  =  +  (?; 

-q  -q 

(5) 
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2,  ^  1.2 

Wn  =  -^^eTn  +  2”*nVdn 

Qn  =  -Ken  V  7’,  Ken=  2Tn{kB I qf{nqHn)  (7) 

Here,  4>  is  the  electric  potential;  n  and  p  are  the  electron  and  hole  densities 
respectively;  D  is  the  net  doping  concentration;  Jn  is  the  electron  current 
density;  R  denotes  the  recombination  process;  is  the  average  electron  en¬ 
ergy  and  5„  is  the  electron  energy  flux;  E  is  the  electric  field;  Tnu  is  the 
energy  relaxation  time  for  electrons;  is  the  average  carrier  energy  in  ther¬ 
mal  equilibrium;  pn  is  the  electron-temperature-dependent  mobility;  D„  is 
electron  temperature-dependent  diffusivity;  r„  is  the  electron  temperature; 
m*  is  electron  effective  mass;  vj„  is  the  electron  mean  velocity;  Q  is  the  heat 
flux.  (We  note  that,  in  an  effort  to  be  concise,  only  the  HD  equations  for 
electrons  were  shown.  However,  our  calculations  include  the  self-consistent 
solution  of  the  hole  and  electron  HD  equations.) 

The  typical  approach  at  this  point  is  to  solve  the  above  system  of  equations  by  using 
n,Tn  as  the  unknown  state  variables.  However,  the  current- continuity  and  the  energy- 
balance  equations  under  this  approach  may  give  rise  to  numerical  difficulties  such  as 
stability  problems  and  spurious  spikes  in  average  electron  velocity.  To  overcome  these 
numerical  proble-ois,  we  take  another  approach  by  first  defining  a  new  set  of  Slotboom-like 
state  variables.  Then,  we  transform  the  HD  equations  into  self-adjoint  forms  with  these 
new  variables.  A  new  Scharfetter-Gummel-like  discretization  scheme  is  then  employed  to 
resolve  the  rapid  variations  in  n  and  T„.  The  resulting  matrix  equations  are  diagonally 
dominant  and  exhibit  excellent  numerical  properties. 

III.  The  New  Method  for  Solving  HD  Equations 


A.  Slotboom-Like  Variables  for  the  HD  Model 

We  define  a  new  set  of  Slotboom-like  state  variables  u  and  ff„  for  electron  density  and 
electron  temperature: 


„  =  exp  =  „.„exp  (8) 

T„  =  Pnexpid^n/OT),  or  =  8  =  0.0207V  (10) 

\PnnkB  5  q 

where  V’n  is  the  electron  quasi-Fermi  potential  and  Ti  is  the  lattice  temperature. 


B.  The  Self-Adjoint  form  of  the  HD  model 

Substituting  eqns.  (8)(9)  into  (4),  and  eqn.  (10)  into  (5)  respectively,  the  electron 
current  density  and  electron  energy  flux  in  terms  of  the  Slotboom-like  variables  u  and  jr„ 
are  as  follows: 
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(11) 


r  r»  {'^”'  —  TL  q<i>\ 

=  +  — jv». 

Sn  =  -«e«exp(0„/aT)  V5n  +  (12) 

-q  I 

Eqn.  (11)  can  be  further  simplified  to  J„  =  —q/in^  V  0n-  The  steady  state  HD  model 
can  now  be  described  by  a  system  of  self-adjoint  equations.  This  system  can  be  obtained 
by  appropriately  substituting  eqns.  (11)-(12)  into  eqns.  (l)-(3). 


■?(■ 


fTn  —  Ti  q<i>  ,  Tp  —  Ti  q<i>  ^ 

exp( - - - -I-  T^)  -  uexp( — ^ - t-tt) 


( r\  ,  Tn-Tl  ,  q4> 

(^0,n,exp(-^^  +  _)V 


uj  =  R{i^,u,v) 


V  •  («cnexp(V>„/aT)  Vffn)  = 


It  is  clear  from  the  above  expressions  that  the  Poisson,  the  current-continuity  and  the 
energy-balance  equations  are  each  self-adjoint  differential  equations  with  respect  to  the 
variables  <i>,u  and  It  is  also  interesting  to  note  that  when  Tn  is  equal  to  Tt,  the 
above  new  expressions  for  the  Poisson  and  current  continuity  equations  reduce  to  the 
DD  model[2]. 

C.  Discretization  Scheme  and  Iterative  Method  for  HD  Equations 

A  Scharfetter-Gummel-like  method  is  employed  to  discretize  the  current-continuity 
equations  and  the  energy- balance  equations.  The  discretizations  result  in  a  diagonally 
dominant  coefficient  matrix  for  each  HD  equation.  A  fixed-point  method  is  applied  to 
solve  the  system  of  discretized  HD  equations.  Due  to  the  property  of  diagonal  dominance, 
the  convergence  for  the  solution  of  each  HD  equation  is  guaranteed  [2,3]. 

rV.  Numerical  Results 

To  test  convergence  of  the  technique,  we  generated  initial  guesses  using  a  random 
function.  With  random  initial  guesses,  the  same  results  were  obtuned  as  when  good 
initial  guesses  were  used.  To  examine  stability,  we  simulated  MOSFET's  on  a  rather 
coarse  grid  of  less  than  400  mesh-points  and  smooth  results  were  obtained.  Additionally, 
we  found  that,  in  general,  our  HD  simulations  required  only  about  75%  more  CPU 
time  than  drift-diffusion  model  simulations.  Fig.  1  shows  the  simulation  results  for  a 
submicron  MOSFET  with  0.5/im  channel  length.  In  Fig.  2,  we  show  the  results  of  a 
computation  performed  for  a  l/itn-channel  SOI  device.  Finally,  to  test  the  accuracy  of 
the  new  HD  model,  we  compared  our  simulations  to  MC  calculations  for  a  square  shape 
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field.  Fig.  3  shows  good  agreeir  nt  between  MC  simulations  and  our  HD  calculations. 
Also  interesting  in  Fig.  3  is  that  the  spurious  second  overshoot  spike,  which  usually 
appears  in  HD  simulations,  is  absent  from  our  calculations. 
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Figure  1 .  2-D  Submirron  MOSFET  Simulation  Results 
(Vgs  =  3V,  Vds  =  3V,  Channel  Length  =  O.o^m) 
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Abstract 

The  high-frequency  re^nse  of  resonant  tunneling  (tevices  (RID),  subjected  to  time-dependent 
signal,  is  conside^  in  both  stable  and  unst^k  situations  ^irfien  operating  in  the  negative 
differential  resistance  (NDR)  region,  based  on  die  phase-space  distribution  function  formalism. 
In  the  stabk  cme,  die  equivalent-circuit  ^qiroach  (ECA)  is  shown  to  diaracterize  die  conqikx 
high-frequency  bdhavior  cf  a  RTD*s  response  to  small  a.c.  signal.  The  ECA  is  found  to  be  very 
useful  in  resolving  the  various  outstanding  controversies  concerning  the  dynamical  quantum 
tran^rt  behavior  of  RTD.  For  the  unstabk  case,  nonperturbative  approaches  are  outlined. 


I.  Introduction 

There  is  much  confusion  in  the  literature  concerning  the  high-frequency  behavior  of  resonant 
tunneling  devices  (RTD).  The  numerical  simulation  of  Frensky  [1],  using  Ae  Wigner  distribution- 
function  transport  equation  (WDFTEQ),  reveals  a  c^achive  behavior  at  lower  frequencies, 
eventually  changing  into  an  inductive  behavior  at  hi^r  frequencies.  In  contrast,  the  numerical 
simulation  of  Kluksdahl,  et  al.  [2]  using  a  similar  WDFTEQ  qiproach  reveals  an  inductive 
behavior  at  low  frequencies,  changing  into  a  ciqiacitive  bdiavior  at  intermediate  frequencies,  and 
eventually  changing  back  iitto  a  (somewhat)  inductive  behavior  at  hij^  frequencies.  Bodi  of  diese 
simulations  do  agree  concerning  the  real  pait  of  the  admittance,  namely,  it  is  negative  at  lower 
frequencies  and  becomes  positive  at  high  frequencies.  Their  results  also  concur  at  hi^  frequencies 
by  having  a  cutoff  in  the  admhtaiKe.  Until  now,  die  discrepancy  in  die  reactive  behavior  cited 
atove  has  remained  unresolved. 

So  far,  diere  are  no  other  serious  work  on  the  subject  of  characterizing  die  high-frequency 
reqionse  of  RTD  to  a.c.  signal,  although  diere  have  been  a  number  of  attenqits  to  analyze  the 
high-frequency  bdiavior  of  nanometric  structures.  These  are  attempts  to  extend  die 
Landauer-Buttiker  viewpoint,  which  calculates  conductance  from  transmission  coefficknts,  to 
high  frequencies.  The  gross  defickncies  of  all  diese  attempts  have  been  discussed  by  Landauer 
[3].  There  are  also  attempts,  for  exanqik  by  Fu  and  Dudky  [4],  iihidi  employ  die  machinery  of 
the  linear  re^nse  theory,  however  diis  particular  approa^  is  plagued  by  its  iruhility  to  treat 
far-from-equilibrium  operating  bias  conditions,  of  interest  to  ^  nanoekctronics  community. 
More  recent  results  of  Cai  and  Lax  [S]  employ  a  nonperturbative  time-dq>endent  Cfreen's  function 
approadi  to  conqiute  die  bdiavior  of  an  electron  incident  from  an  energy  channel  on  a  dcnibk- 
barrkr  structure.  Their  results  reveal  die  presence  of  intrinsic  oscillations  in  response  to  a  voltage 
pulse  across  the  negative  differential  resistance  (NDR)  region,  similar  to  die  result  obtained  by 
Buot  and  Jensen  [6],  particularly  vihen  self-consistency  was  switched  off  in  dieir  WDFTEQ 
calculations.  The  results  of  Cai  and  Lax  are  hoover  limited  to  singk-diannei  and 
nonselfconsistmt  contributions. 


85 


The  purpose  of  diis  pi|)er  is  twofold;  to  show  that  a  coherent  and  unified  undmtanding  of  the 
high-frequency  behavior  of  RTD  arises  from  the  equivalent-circuit  model  of  Buot  and  Joisei  (BJ) 
[7],  to  be  referred  here  as  the  BJ  model  (originally  pn^sed  to  study  the  nonlinear  a^iecu  of 
RTD  which  arises  from  die  selfconsistent  treatment  of  WDFTEQ),  and  to  point  out  a  proper  w^ 
to  a  noiqierturbative  theoretical  framework  for  analyzing  the  high-frequency  behavior  of  resonant 
tunneling  devices  within  the  quantum  distribution-function  approadi.  It  is  worthwhile  to  stress  the 
first  point  here  since  there  have  been  misleading  statements  in  the  literature  claiming  that  no 
understanding  exists  for  quantum  transport  at  nonzero  frequencies.  Also  there  is  a  need  to  clarify 
a  statement  made  in  the  literature  which  strongly  implies  that  no  LRC  circuitry  can  simulate  the 
complicated  frequency  dqiendence  of  the  admittance  in  the  linear-response  regime,  particularly 
since  this  statement  is  ma^  widuHit  further  reservations.  We  will  here  show  that  ind^  die  two 
earlier  luimerical  simulations  mentioned  above  [1,2]  nu^  be  recast  in  terms  of  the  BJ  model  in 
two  sqiarate  regimes  in  die  BJ  circuit  parametrization,  operating  in  the  NDR  region.  Thus,  die 
BJ  model  is  found  to  include  the  high-frequency  behavior  of  RTD. 


II.  Equivalent-Circuit  Model 

The  BJ  model  is  derived  from  the  selfconsistent  quantum  distribution-function  (QDF)  transport 
simulation,  with  four  independent  equivalent-circuit  parameters,  namely,  series  resistance,  R, 
negative  conductance,  G,  quantum  inductance,  L,  and  capacitance,  C.  The  RTD  simulated  by  BJ, 
with  fixed  bias  applied  in  the  NDR,  is  characterized  by  the  following  inequalities  for  the  circuit 
parameters:  R  |G|  >  1  and  RC  >  L|G| .  These  are  conditions  for  the  presence  of  oscillatory 
behavior  and  instability  of  die  operating  point  in  the  middle  of  the  NDR  region  [7]  leading  to  dw 
intrinsic  bistability  and  hysteresis  in  the  current-voltage  characteristics,w^ich  was  Ae  focus  of  die 
original  BJ  model.  In  contrast,  we  now  find  that  the  RTD  simulated  by  Frensky  [1]  and 
Kluksdahl,  et  al.  [2]  are  obtained  if  the  following  inequalities  are  obeyed:  R  |G|  <  1,  and  RC 
<  L I G I ,  along  with  furdier  conditions  to  be  specified  in  wdiat  follows.The  above  are  conditions 
for  the  nonoscillatoty  and  stable  operation  in  die  middle  of  the  NDR  region  allowing  for  the 
applicability  of  linear-response  ^roximation  [8].  They  also  imply  diat  the  series  resistance  is 
quite  small  compared  to  that  of  Ae  RTD  simulated  by  BJ  (note  Aat  the  RTD  simulated  by  BJ, 
although  having  approximately  the  same  feature  sizes  for  the  double-barrier  region,  differs  in 
having  an  undoped  buffer  layer  on  bodi  sides  of  die  double  barrier,  w4iich  nu^  also  affect  the 
value  of  the  capacitance  relative  to  diose  of  Refs.  [1]  and  [2],  and  a  longer  conqiutational  box 
length,  w^ich  togedier  introduces  a  larger  R).  Moreover,  the  RTD  simulated  by  Frensley  [1], 
widiout  taking  self-consistency  into  account,  differs  from  that  simulated  by  Kluk^ahl  et  al.  [2], 
which  takes  self-consistency  into  account,  by  the  following  inequality:  L|G|  <  C/|G|  for 
Frensley  [1],  whereas  L|G|  >  C/|G|  for  Kluksdahl  et  al  [2].  Ihese  imply  a  larger  quantum 
inductance  for  the  RTD  of  Ref.  [2]  as  compared  to  diat  simulated  in  Ref.  [1],  where  the  residual 
small  inductance  is  mainly  due  to  the  quantum  nonlocality  [8].  Further,  note  that  Ref.  [2]  uses 
wider  barrier  widths  than  diose  used  in  Refs.  [1]  and  [7].  Thus  the  BJ  circuif  model  serves  not 
only  to  clarify  the  origin  of  the  discrepancy  of  the  results  of  the  two  numerical  simulations  [1,2], 
but  also  to  bring  togedier  the  various  dynamical  aspects  of  RTD  in  terms  of  WDFTEQ. 

In  addition  to  die  quantum  inductance,  in  parallel  with  die  aq;iacitance,  found  by  BJ  for  fixed  bias 
in  the  NDR  region,  it  is  important  to  tike  also  into  account  effects  arising  from  die  electron 
kinetics  in  calculating  die  RTD  response  to  high-frequency  a.c.  signal.  These  effects  are  often 
referred  to  as  due  to  die  electron  inertia,  i.e.,  it  takes  time  for  the  electron  to  be  accelerated  and 
decelerated,  typically  causing  the  current  to  lag  in  time  bdiind  die  electric  field.  These  inertial 
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eflbco,  iWiough  ahnqrs  present,  are  negligible  « low  fre<piencies.  b  is  taken  into  acootmt  in  our 
iiigii*freqiienqr  equivalent  circuit  model  of  RTD  by  adding  another  inductance,  f ,  (f  is  typically 
n  order-of-magnitiide  smaller  than  the  quamum  inductance.  L)  in  series  with  R,  outside  of  the 
two-teanch  circuit  of  L,  G,  and  C.  As  will  be  shown  in  the  following  figures,  diis  additional 
"inertial  inductance",  f ,  serves  to  cut  off  the  RTD  response  at  very  high  frequencies. 

We  have  calculated  the  admittance  of  RTD  to  a  small  a.c.  signal  using  the  above-meuioned  high- 
frequency  equivalent  circuit  model.  The  different  circuit  parameters  are  estimated  by  adjusting 
die  values  obtained  by  BJ  to  conform  with  die  constraints  enumerated  above  appropriate  to  the 
different  RTD  sinuilai^.  The  results  are  displayed  in  Fig.  1  for  the  RTD  parameters  appropriate 
to  the  device  simulated  by  Frensley  [1],  and  Fig.  2  for  the  RTD  parameters  iqipropriate  to  the 
device  simulated  by  Kluksdahl  et  al.  [2]  (note  that  the  inductive  part  of  the  admittance  is  plotted 
with  a  positive  scale  in  Ref.  p]).  Observe  that  the  characteristic  features  of  die  results  of  their 
numerical  simulations,  mention^  at  the  beginning,  are  well  reproduced  by  the  present  results, 
(fotained  simply  by  using  the  high-frequency  equivaiem-circuii  model.  Inde^,  di^  results  lend 
further  support  to  the  accuracy  of  the  U  equivalent-circuit  model  of  RTD  at  low  frequencies  [7], 
as  well  as  confirm  the  validity  of  the  present  high-frequency  equivalent  circuit  model  of  RTD 
which  incorporates  a  series-inertial  inductance.  These  results  also  serve  to  invalidate  the  claim  [4] 
made  without  any  reservation,  that  no  LRC  circuit  can  simulate  die  conqilicated  frequency 
dependence  of  the  admittance  in  the  linear-response  regime. 


ni.  Nonperturbative  Approaches  for  Unstabk  NDR  Operation 

For  unstable  operation  in  the  NDR  region,  nonper&irbative  approaches  are  more  tqipropriate. 
Here,  a  new  time-dependent  transformation  of  phase  space  is  found  v4iich  transform  die  quantum 
distribution  (QDF)  transport  equation  to  the  same  form  in  the  absence  of  time-dependent  signal. 
This  is  shown  to  be  a  veiy  powerful  approach  in  revealing  the  nature  of  the  nonperturbative 
response  to  a  small  a.c.  sign^.  This  time-dependent  transformation  is  useful  when  the  applied 
time-dependent  electric  field  is  assumed  to  be  position-indqiendent.  The  general  formulation  of 
the  QDF  transport  equation  in  die  presence  of  space  and  time-dqpendent  potential  is  given  by  one 
of  the  authors  [6,9].  For  high-frequency  signals  applied  at  die  drain  terminal  of  an  RTD, 
following  die  conventional  procedure,  we  assume  a  time-dqiendent  but  position-independent 
applied  perturbing  field  within  die  device.  From  the  general  formula  given  in  Refs.  [6.9],  one 
arrives  at  a  resulting  QDF  transport  equation  in  response  to  this  signal.  We  show  that  die 
resulting  QDF  transport  equation  can  be  transformed  into  the  form  of  the  equation  obtained  earlier 
in  the  absence  of  die  perturbing  time-dependent  field,  dirough  a  time-de^ndent  transformation 
of  the  phase-space.  This  transformation  of  the  equation  however  makes  the  double-barrier 
potential  to  be  time-dependent.  With  this  sinqilification,  the  application  of  the  numerical  technique 
of  Ref.  [6]  widi  known  time-dqiendent  matrix  for  dM  double-barrier  ptnential  operator,  with  time 
as  a  parameter,  becomes  feasible.  The  derivation  of  die  transform^  and  simplified  new  QDF 
transport  equation,  and  die  method  of  the  numerical  inqilementation  for  calculating  the 
nonperturbative  response  to  an  a.c.  signal  will  be  discussed  in  a  fordicoming  paper. 

In  a  more  general  and  realistic  situation,  an  sqiplied  time-dependent  voltage  at  the  drain  will 
selfconsistently  lead  to  a  time-dependent  and  position-dependent  potential  inside  die  device.  For 
diis  general  situation  of  die  unst^le  case,  a  nonperturbative  approach  based  on  die  structure  of 
idiase  space  is  proposed.  We  introduce  two  different  representations  of  quantum  transport, 
namely,  die  Liouville  representation,  and,  die  phase-space  fluid  rqiresentation.  The  QDF  is 
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Fig.  1  -  KmI  (nlid  eiirv*)  aad 
^■M>gi^■ry  (dMhad  curve)  parti 
d  dw  iduiitfife  (in  units  ci 
lO’’  S/ca?)  for  tte  cticuit 
pifnuma  conJunakn  ant 
correipondi  to  Ihi  RTD 
■inuilmil  by  Frmriey  [1]: 
R|G|  <  I.RC  <  L|OU«m1 
L|0|  <  C/|0|.  TheveluM 
mid  ate:  R  ■■  0.2  X  10~^  0 
cni*,  O  ■  -2.5  X  lO^  S/ca?, 
C  -  6.0  X  10-’  F/cn^, 

L  >  0.8  X  10~2  H/cn?,  and 
«  -  0.2  L. 


11  10^2  1013  101^ 

FREQUENCY  (HZ) 


Fig.  2  •  Real  (solid  carve)  and 
iiwaginary  (dashad  carve)  parts 
of  ika  sdmi  Wire  (in  units  of 
10^  S/cai*)  fcr  the  drcait 


oonespoods  to  the  RTD 
sianilsted  1^  Kluksdahl  at  al. 
[2]:R|0|  <  1,  RC  <  L|0|. 
L|0|  >  C/|0|  .  lie  values 
used  are:  R  -  1.3  X  10-^  0 
cm*,  O  “  -0.5  X  lO*  S/cni*, 
C  -  0.4  X  10-‘  F/cm*, 

L  -  9.0  X  10-*H/cm*.aiid 
f  -  0.05  L. 


solved  numerically  in  die  same  fashion  as  described  and  successfully  implemented  in  the  exiting 
Uteratuie,  but  using  time^iqiendent  bias.  The  QDF  solution  has  inhment  und^irriile  features  for 
studying  the  dynamics  of  phwqiace.whidi  can  be  eliminated  by  a  special  post-{Hrocessing.  This 
post-processing  yields  a  smoothed-out  QDF,  the  positive  definite  Husimi  distributkm,  and  allow 
us  to  study  the  structure  of  quantum  tnyectories  in  phase  space.  It  is  suggested  diat  the  use  of 
Husimi  distribution  enables  a  microscopic  dynamical  viewpoint  of  ECA  as  well  as  shed  further 
li^  on  the  dynamical  nature  of  the  quantum  inductance.  The  full  details  of  this  approach  will 
be  discussed  in  anodier  paper. 


IV.  Coochiding  Remarks 

In  conclusion,  the  utility  of  the  QDF  approach  for  understanding  the  high-frequency  behavior  of 
quantum-based  devices  and  in  deducing  the  equivalem  circuit  model  of  RTD  have  been 
demonstrated  here.  It  is  worthvdiile  to  reiterate  diat  the  equivalent  circuit  model  of  RTD  is  subtle 
and  in  the  form  presented  here  elucidates  die  two  earlier  simulations  [1,2]  as  special  cases  of  die 
model  presented  here.  Thus  die  discrepancy  in  the  simulated  reactive  briiavior  mentioned  in  the 
opening  paragraph  is  resolved.  Moreover,  Ae  exact  role  of  electron  inertia  in  die  high-frequency 
b^avior  of  RTD  is  clarified.  For  cases  when  the  linear  reqwnse  approximation  breaks  down, 
nonperturbative  schemes  are  outlined. 
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Abstract 

A  niethod  for  directly  solving  the  coupled  Poisson/Boltznnann  Transport  equation  including  a 
new  model  for  the  collision  term  is  presented.  This  method  yields  the  complete  momentum 
distribution  function  from  which  all  quantities  of  interest  to  the  device  modeller  can  easily  be 
calculated.  Current  continuity  is  implicit  in  the  model.  The  method  is  demonstrated  by  simu¬ 
lating  an  n'^-n-n'^  silicon  structure. 


I  Introduction 

With  decreasing  device  dimensions  there  is  a  need  to  accurately  model  the  effects  of  hot  carri¬ 
ers.  Solutions  of  the  drift-diffusion  and  hydrodynamic  tiKxiels  have  been  widely  used  to  model 
the  behaviour  of  many  submicron  devices  [1].  However,  in  the  analysis  of  the  programming 
of  EPROMs  [2]  and  many  other  phenomena  found  in  modem  small  geometry  devices,  the 
complete  carrier  distribution  function  in  both  ordinary  space  and  momentum  space  is  required 
for  accurate  simulation.  Recently  there  has  been  increased  interest  in  solving  the  Boltzmann 
Transit  Equation  (BTE)  to  obtain  the  distribution  function  [3, 4],  Knowing  the  distribution 
function  and  electric  potential,  all  other  quantities  of  interest  can  be  calculated.  In  this  paper  a 
method  for  solving  the  coupled  Poisson/Boltzmann  Transport  equations  is  presented. 


n  Mathematical  model 


The  BTE  for  steady  state  conditions  for  a  semiconductor  device  is 


v.V,/-p.V*/ 


-{%) 


(1) 


COLL 


(2) 


where  v  is  the  carrier  velocity,  E  is  the  electric  field  and  f  is  the  distribution  function.  The  right 
hand  side  of  equation  1  is  the  collision  term  and  is  often  approximated  using  the  relaxation  time 
approximation  [5]  as 

^  /  COLL 

where  fo  is  the  distribution  function  at  equilibrium  and  r  is  the  relaxation  time.  Integrating 
equation  1  over  momentum  gives  the  current  continuity  equation  in  the  abscence  of  generation- 
recombination.  However,  if  the  relaxation  time  approximation  is  used  the  integral  of  the  collision 
term  over  momentum  is,  in  general,  not  equal  to  zero  because  the  electron  concentration  differs 
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frmn  equilibrium  to  non-equilibrium  conditions.  Thus  the  relaxation  time  iqiproximation  does 
not  satisfy  the  condition  of  current  continuity,  and  is  therefore  invalid  for  non-equilibrium 
coodititms. 

A  new  ^>im>xinuuion  is  made  to  the  collision  term  of  the  BTE 


X^/COLL  ’’ 


(3) 


J  g(k)dk  =  (4) 

whoe  g(ib)  is  a  Gaussian  function  in  A  and  n  is  the  carrier  concentration.  The  carrier  concentration 
is  given  by 

»  =  (5) 

If  r  is  a  function  of  ordinary  space  only,  then  the  integral  of  equation  3  over  momentum 
q>ace  is  always  equal  to  zero,  llierefore  this  approximation  always  satisfies  the  condition  of 
current  continuity.  Gose  to  equilibrium  the  new  approximation  approaches  the  relaxation  time 
approximation  because 

g(k)n  =  foik)  (6) 

at  equilibrium. 


in  Discretization  in  one  spatial  and  one  momentum  dimension 

The  discretizatian  of  the  governing  equations  in  all  three  spafial  and  three  momentum  dimen¬ 
sions  would  lead  to  a  very  large  system  of  equations  to  be  solved  simultaneously.  Therefore 
only  one  spatial  and  one  momentum  dimension  are  included.  Poisson’s  equation  is  discretized 
in  one  spatial  dimension  using  finite  differences.  The  BTE  is  discretized  in  one  spatial  and  one 
momentum  dimension  using  finite  differences,  the  gradients  being  approximated  using  central 
differences.  Vifith  just  one  spatial  and  one  momentum  dimension  it  is  assumed  that  the  distri¬ 
bution  of  carriers  in  the  other  two  momentum  dimensions  is  the  same  as  at  equilibrium  and 
that  collisions  only  scatter  carriers  forwards  or  backwards  in  that  one  momentum  dimension. 
The  velocity  is  modelled  on  a  parabolic  band  structure.  For  simplicity  the  BTE  and  Poisson’s 
equation  are  solved  using  Gummel’s  method  [S]  subject  to  appropriate  boundary  conditions. 


IV  Boundary  conditions 

The  contacts  are  taken  to  be  Ohmic.  The  potential  at  the  contacts  is  set  equal  to  the  sum  of  the 
iqrplied  voltage  and  the  built  in  potential.  I^r  the  BTE  a  Dirichlet  boundary  condition  is  imposed 
at  the  extremities  of  nnomentum  and  a  natural  boundary  condition  is  applied  at  the  contacts.  At 
the  maximum  and  minimum  values  of  momentum,  the  distribution  function  is  set  equal  to  the 
equilibrium  distribution  function.  At  the  ctmtacts  the  gradient  of  the  distribution  function  is  set 
equal  to  zero  assuming  thoe  is  a  neutral  q>ace  charge  region  close  to  the  contacts. 
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V  Results 

The  model  is  applied  to  simulating  the  operation  of  a  lightly  doped  n'^'-n-n'*'  diode.  The  n'*^-n-n'*^ 
diode  has  an  ovorall  length  of  10  microns  with  the  outer  2.5  microns  at  each  side  doped  at  10’^ 
cm~^.  The  S  micron  region  in  the  centre  is  doped  at  10^^  cm~^.  The  contact  at  the  right  is 
grounctod  and  a  bias  is  applied  to  the  left  contact  Figure  1  shows  the  potential  distribution  across 
the  (kvice  for  an  iqpplied  bias  of  400  mV.  Figure  2  shows  the  electron  concentration  under  the 
same  conditions.  Tte  distribution  function  at  the  right  contact  is  shown  in  figure  3  along  with 
the  distribution  function  where  the  carrier  concentration  is  lowest  (3.5  /rm  from  the  left  con¬ 
tact).  The  greater  displacement  from  equilibrium  of  the  distribution  function  where  the  carrier 


k  (xloV*) 


''Figure  3:  Shape  of  distribution  function 

concentration  is  lower  corresponds  to  a  greater  average  energy  of  the  carriers  towards  the  centre 
of  the  device.  Figure  4  shows  the  average  energy  of  electrons  at  300K  across  the  device.  Note 
that  the  carriers  a  distance  2.5  microns  from  the  right  contact  (barrier  region)  have  an  average 
energy  lower  than  the  lattice  thermal  energy,  a  phenomenon  which  occurs  when  carrier  diffusion 
is  in  the  same  direction  as  the  external  force  on  the  carriers  [6].  Current  continuity  is  implicit  in 
the  model  and  is  observed  for  all  applied  biases.  Current  density  in  ths  device  is  easily  calcu¬ 
lated  from  the  momentum  distribution  function.  The  current  density  versus  applied  bias  for  this 
structure  is  shown  in  figure  5.  This  result  is  identical  to  diat  predicted  by  the  drift  diffusion  model. 
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Figure  4:  Average  electron  energy  for  a  bias  of  400  mV  applied  to  the  left  contact. 


Figure  5:  Current  density  versus  ^lied  voltage. 
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Abstract 

A  new,  quasi-two-dimensional  physical  device  model  was  developed  which  allows  the  DC 
small-  and  large-signal  microwave  characteristics  of  HEMTs  and  MESFETs  to  be  obtained 
based  on  device  geometry  and  process  data.  It  is  easily  tq)plied  to  a  wide  variety  of  HEMT 
structures  including  pHEMT,  AlGaAs/GaAs  and  multichaimel  structures. 


L  Introduction 

Field  effect  transistor  physical  models  can  be  used  in  several  electronic  engineering  areas, 
including  device  and  circuit  design.  Analytical  models,  due  to  there  simplicity,  can  not 
reliably  predict  device  performance  wdiereas  two  dimrasional  models  (both  hydrodynamic  and 
Monte  Carlo)  are  certainly  predictive  and  usually  accurate,  but  computationally  very 
demanding.  Quasi-two-dimensional  (Q2D)  modelling  offers  a  good  compromise  as  it  is  up  to 
1000  times  faster  than  full-two-dimensional  modelling  and  still  reasonably  accurate.  In 
practice  I-V  characteristics  and  microwave  S  parameters  can  be  obtained  in  a  few  minutes  on 
a  personal  computer  with  a  80486  processor.  Our  pro^am  can  also  determine  simultaneously 
the  elements  of  non-linear  quasi-static  equivalent  circuit  model.  S  parameters  are  calculated 
using  a  time-domain  version  of  the  physical  model. 

The  Q2D  approach  is  based  iq>on  die  fact  diat  the  equipotential  Mnes  in  the  active  charmel 
(i.e.  outside  Ae  depleted  region)  of  a  HEMT  or  MESI^T  are  fairly  parallel,  perpendicular  to 
die  free  surface.  Therefore  die  analysis  of  die  device  can  be  subdivided  into  establishing  the 
charge  control  law  by  the  gate  in  die  direction  perpendicular  to  die  gate  (y  direction)  and 
calculating  die  current  transit  in  the  direction  parallel  to  the  gate  (x  direction). 

n.  Charfc  Control  Law 

In  die  charge  control  model  a  vertical  cross-section  of  the  device  is  analysed  (considered  as 
a  Sdiottky-diode).  The  calculation  of  die  sheet  electron  and  ionised  donor  concentrations  are 
Khieved  by  self-consistendy  solving  Poisson's  equation  with  Fermi  statistics  and  die 
Schrbdinger  equation.  Fast  convergence  can  be  achieved  using  Newton's  algorithm.  Up  to  nine 
Eigen  energies  can  be  evaluated  when  solving  die  Schrddinger  equation  though  for  practical 
reasons  usually  only  two  or  diree  are  calculated,  vriiile  the  rest  of  the  electrons  are  treated  as 
those  belonging  to  the  three-dimensional  electron  population.  When  the  quantum  well  is  not 
very  deq>,  as  is  often  the  case,  it  is  possible  to  by-pass  die  quantum  mechanical  calculations 
which  makes  the  simulatimi  even  faster. 

An  advantage  of  this  tqiproach  is  accuracy  and  gmerality,  as  opposed  to  the  triangular  well 


w  square  well  approximations,  which  usually  contain  m  enq>irical  fitting  parametm^  in  order 
to  improve  accuracy  [1]. 


At  the  end  of  calculations  a  look-iq>  table  is  produced  which  cmitains  the  gate  voltages  and 
die  sheet  electron  and  ionised  dcmor  ccmcentratkms  in  each  epitaxial  li^er.  In  case  of 
pseudomorphic  structures  the  effect  of  strain  m  the  band  gap,  conductimi  band  edge 
discontinuity,  dielectric  constant  and  effective  mass  is  calculated. 


gate  width:  1.0  mm 
gate  length:  0.3  pm 
source  gate  distance:  1 .0  pm 
gate  drain  distance:  1.3  pm 
layer  structure: 

AlGaAs(electron  supplying  liyer):  30  nm,  A1  content:  20%,  donor  concentratimi:  1.2xl0^^m'^ 
AlGaAs  (^acer  l^rer):  3  nm.  A1  content:  20%,  donor  concentration:  IxlO^m'^ 

InGaAs  (active  li^er):  18  nm.  In  content:  15%,  donor  concentration:  1x1  O^m*^ 

GaAs  (buffer  layer):  ISO  nm,  donor  concentration:  IxlO^m'^ 


Table  1.:  The  structure  and  properties  of  die  simulated  pHEMT  device 

For  a  pseudomorphic  AlGaAs/InGaAs/GaAs  HEMT  (see  Fig.l.  and  Table  1)  the  conduction 
band  edge  and  the  electron  concentration  is  shown  in  Fig.2  and  Fig.3  for  zero  gate  bias.  The 
dependence  of  sheet  electron  and  ionised  donor  concentration  on  the  voltage  applied  to  the 
Sdiottky-diode  is  shown  in  Fig.4.  It  can  be  seen  that  at  negative  gate  biases  (close  to  pinch- 
off)  the  sheet  electron  concentration  is  several  orders  of  magnitude  lower  dian  in  Fig.3.  . 


The  quantum  mechanical  calculations  ware  not  by-passed  in  the  present  simulation  (in  favour 
of  Fermi  statistics),  as  it  would  have  led  to  about  20%  overestimation  of  die  dieet  election 
concentration  in  the  InGaAs  li^er  and  10%  ovwestimation  of  die  total  sheet  electron  density. 
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Figure  2.  Energy-band  diagram  of  a  vertical  section  below  the  gate  of  the  pHEMT 
in  Table  1. 


Figure  3.  Electron  density  for  the  cross-section  of  a  pHEMT  corresponding  to  die  conditions 
shown  in  Figure  2. 


m.  Chanatl  SiaMdaSar 

In  die  channel  simulator  Poisson's  equation  and  the  tran^rt  equations  are  solved  self- 
consistandy  in  die  x-direcdon  assuming  the  gradient  of  die  Fermi  level  to  be  tme-dimensional. 
The  tnm^rt  equations  are  simplified  versions  of  the  first  diree  moments  of  the  Boltzmaim 
equation,  die  particle  (current  continuity),  momentimi  nd  energy  conservation  equations  [2]. 
T^  trnms  describing  die  scattering  of  free  dectrons  are  calculated  based  on  analytical  fits  to 


Monte  Cerio  stmulation  results  [3],  and  are  all  CMisidered  to  be  the  function  of  average 
electron  energy. 


i 
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Figure  4.  Charge  control  law:  sheet  dectron  concentratiMi  and  ionised  donor  concentration 
in  each  li^er 


The  sheet  election  and  ionised  donor  concentration  is  found  from  the  above  look*up  table, 
based  on  the  difference  of  channel  and  gate  (or  surface)  potential,  with  a  correction  arising 
from  the  gradient  of  electric  field  in  the  x-directi<m.  As  this  term  can  be  quite  large,  especially 
around  the  drain  end  of  the  gate,  more  accurate  calculations  diould  use  a  double  subscripted 
look-up  table  for  die  dieet  charge  densities  to  include  die  grad(EJ  dependence.  Due  to  the 
large  memory  requirement  diis  approach  is  more  suitable  for  simulations  running  <m 
workstations  than  on  PCs,  therefore  an  analitic  qiproximadon  was  used  in  diis  work. 

Unlike  odier  recendy  published  models,  our  model  can  simulate  parasitic  MESFET 
conduction,  allowing  a  more  accurate  representadcm  of  device  operation.  In  fact,  HEMT  and 
MESFET  devices  widi  an  arbitraiy  number  of  different  layers  can  be  simulated. 


The  calculations  at  a  specified  source  current  proceed  from  source  to  drain  using  a  simple 
forward  differencing  scheme,  after  omitting  die  diffusion  term  in  the  momentum 
conservadmi  equation  based  on  die  consideradons  in  [2].  The  combination  of  Poissons's 
equation  and  the  transport  equations  yields  a  quadratic  equation  for  the  electric  field  or  the 
drift  velocity.  The  drain  voltage  corresponds  to  die  diannel  potential  at  die  drain  and  is 
obtained  integrating  die  electric  field  along  die  channel  (Fig.S).  The  I-V  characteristics  of 
die  pHEMT  is  diown  in  Fig.6. 
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Figure  S.  The  electric  field,  channel  potmtial,  average  electron  anergy  and  drift  velocity  in 
the  conduction  channel  at  bias  point:  V„  »  *0.4  V,  l.OV 
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Figure  6.  DC  charactenstics  of  the  simulated  pHEMT  device 
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Abstract 

The  advent  of  low  cost  high  performance  workstadtms  and  new  visualisation  mols  is  coincident 
with  a  growth  of  new  problems  in  electronics  ranging  from  large  scale  power  devices  to 
nanometer  dimensions.  The  rise  of  molecular  electronics  and  bioelectronics  points  to  new 
regimes  for  the  device  modeller  to  explore.  It  is  argued  that  some  software  discipline  especially 
standards  and  potability  is  required  if  the  modelling  community  is  to  be  effective.  Lessons  may 
be  drawn  from  successes  in  other  fields  such  as  quanmm  chemistry  and  molecular  nradelling. 

I.  Introduction 

Just  seven  years  fton  the  twenty-first  century  it  is  perhaps  a  good  time  to  reflect  on  the  future 
of  computational  electronics.  The  modelling  and  simulation  of  solid-state  devices  is  at  an 
important  watershed  in  its  history  because  for  the  first  time  it  is  possible  to  imagine  universal 
access  to  software  and  mathematical  techniques  which  will  allow  any  experimentalist,  theorist, 
industrialist,  whatever,  access  to  knowledge  which  was  at  one  time  the  select  speciality  of  a 
few  groups  around  the  woiid.  The  changes  are  being  brought  about  by  the  microelectronics 
revolution  which  now  provides  the  power  of  1980s  mainframes  and  early  supercomputers  in 
relatively  cheap  wmkstations  -  with  promises  of  more  to  come.  There  is  a  similar  change  in 
software  habits;  in  the  last  few  years  industrial  employers  of  our  engineering  and  science 
graduates  have  begun  to  expect  proficiency  in  languages  such  as  C  rather  than  the  tradititmal 
FORTRAN;  and  some  nodding  acquaintance  is  now  expected  witii  the  concept  of  “user- 
friendliness”.  It  is  no  longer  fashionable  to  It^  up  hours  of  Cray  time  to  produce  a  couple  of 
numbers.  There  are  expectations  aroused  that  something  better  is  required.  Explanations, 
interpretations,  insight,  visualisation,  projectitms  are  required  quickly,  graphically  and  easily. 
Computer  scientists  are  devoting  time  to  tq;ncs  such  as  persistent  programming  which 
addresses  the  issues  of  25  year  old  and  older  software.  Computation  is  now  one  strand  of  more 
c(Miq)lex  activities;  any  computed  data  is  to  be  regarded  as  part  of  a  rich  interacting  hierarchy  of 


knowledge  which  can  be  used  by  different  people  for  different  reasons.  In  computational 
electronics  we  have  only  just  begun  to  realise  what  these  changes  might  entail  and  it  is  the 
purpose  of  this  paper  to  give  a  personal  view  of  what  we  have  to  look  forward  to.  1  want  to 
start  by  first  looking  back  to  the  beginnings  of  the  subject(for  me  anyway)  in  the  1960s. 

2.  Some  history 

Coincidentally,  it  is  also  a  quarter  of  a  century  since  the  first  conference  on  computational 
electronics  held  at  the  Culham  laboratories  in  1968.  At  that  meedng  were  some  of  the  first 
papers  on  computational  electron  transport  theory  inclu'''>g  one  by  myself  and  Cliff  Hearn 
which  dealt  with  self-scattering  in  a  Monte  Carlo  simulation  of  photo  excited  hot  electrons  and 
which  used  propagator  techniques  to  prove  that  the  Monte  Carlo  solutions  were  equivalent  to 
solutions  of  the  Boltzmann  equation.  At  that  time  Culham  was  a  centre  for  computational  wc»ic, 
mainly  due  to  the  intense  interests  in  plasma  physics  and  to  a  lesser  extent  neutron  transport 
theory.  Culham  provided  an  educational  distribution  point  with  series  of  weekly  lectures  on 
computational  physics  oiganised  by  Potter.  The  smallness  of  the  computers  of  those  days  made 
the  use  of  compact  ingenious  algorithms  the  order  of  the  day  rather  than  brute  force  number 
crunching.  It  is  not  surprising  that  Potter  went  on  to  develop  the  Psion  organisers  and  other 
compact  systems. 

By  1%9  much  of  the  familiar  theoretical  base  of  ctmiputational  electronics  was  already  in  place. 
JFET  theory  had  been  set  up  in  1947,  Bipolar  transistor  theory  by  1952,  MOSFET  theory  by 
1960,  MESFETs  in  1966.  Of  the  computational  techniques,  drift-diffusion  theories  dated  from 
1952,  hydrodynamic  models  from  1962,  the  Gumtnel  algorithms  from  1964  and  the  Erst  solid 
state  electron  transport  theory  handled  by  Monte  Carlo  was  reported  by  Kurosawa  in  I966(the 
latter  work  was  part  of  the  inspiration  that  guided  me  into  transport  theory  especially  hot 
electron  theory  in  1967).  But  many  of  these  techniques  were  really  derivative:  Monte  Carlo 
studies  were  enjoying  extensive  use  in  neutron  transport  theory  (although  Lord  Rayleigh  was 
doing  much  the  same  thing  without  computers  in  the  last  century);  and  the  influence  of 
hydrodynamics  (particularly  driving  the  aerospace  industries  from  WWII  onwards)  and  plasma 
physics  cannot  be  under-estimated.  There  were  already  major  forays  into  quantum  kinetic 
theories  although  these  were  still  of  a  formal  rather  than  practical  nature. 

What  really  limited  practical  triplications  was  computing  power .  The  really  interesting  (as  well 
as  applicable)problems  were  either  highly  complex  or  non-linear  or  both.  Looking  back  I  am 
still  amazed  by  the  lack  of  imagination  we  all  had.  “You  are  only  limited  by  your  imagination” 
is  the  proud  boast  of  many  a  computer  advertisement  in  the  1990s.  Well,  my  limit  was  a  factor 
between  3  and  10.  In  1967 1  developed  code  (ALGOL  60)  fen*  simulating  the  time  evolution  of 
Gunn  domains  and  similar  effects  in  photoconductors  using  an  Elliot  803  computer  with  32K 


core  memory.  A  little  later  the  offer  of  time  on  the  national  ATLAS  computer  at  Harwell 
brought  the  luxury  of  90K  menrary  but  entailed  a  two  day  wait  to  receive  the  wad  of  data 
(mainly  error  messages  and  core  dumps)  fix>m  the  post.  That  extra  factor  of  3  in  metiKH^ 
seemed  impressive,  and  every  few  years  we  in  the  computational  sciences  looked  fcxward  to 
that  extra  factm*  of  ten  w  so  which  would  bring  the  biggest  of  problems  into  our  control.  It  is 
perhaps  fortunate  that  the  compute  did  not  exist  because  the  contrd  and  detailed  understanding 
of  solid  state  device  materials  was  not  sufficent  fen*  realistic  nxxlelling.  Not  only  was  compute 
power  singularly  poor,  but  means  of  handling  the  output  data  were  primitive.  In  1968  I  spent 
maybe  one  week  at  a  time  collecting  plotter  pictures  on  large  rolls,  selective  frames  of  which 
were  placed  along  a  cenridor  to  facilitate  a  “computer  movie”  of  Gunn  domain  dynamics. 
Today,  that  entire  problem,  including  symbolic  manipulation  and  computer  movies  can  be 
handled  using  Mathematica  on  a  Macintosh  computer  in  a  few  minutes.  It  is  this  simple 
reminiscence  which  brings  home  the  immense  opportunities  that  we  now  have  for  unleashing 
computational  electronics  from  the  grip  of  computational  poverty.  But  others  have  been  there 
first:  the  computational  chemists  have  achieved  a  close  partnership  with  their  industry  which 
has  produced  an  effective,  widely  shared  approach  to  applicable  computational  science  which 
we  in  computational  electronics  could  learn  from. 

III.  Physical  modelling  versus  equivalent  circuit 

There  has  always  been  a  division  between  those  advocating  the  empirical  approach  to  device 
and  particularly  circuit  modelling  and  those  favouring  the  develq^ment  of  physical  nxxlels.  The 
advantages  and  limitations  of  the  two  approaches  are  well  known:  industry,  especially  given  the 
exponential  pace  of  integrated  circuit  technology,  has  needed  data  quickly  and  cost-effectively; 
empirical  or  equivalent  circuit  models  do  the  trick  -  in  the  shrat-term.  But,  that  approach  and  a 
poor  investment  in  physical  nxxlels  led  one  well-known  review  paper  in  the  late  1970s  to  state 
that  it  was  highly  unlikely  that  silicon  transistors  could  be  manufactured  with  design  rules  less 
than  10  microns.  It  is  also  true  that  physical  nxxlels  give  the  best  route  to  device  understanding 
and  to  devising  new  device  concepts  -  provided  the  underlying  physical  model  is  reliable.  The 
latter  case  is  still  a  problem  because  we  do  not  usually  know  how  to  model  surface  and 
interface  charge  very  readily.  Indeed,  the  whole  problem  of  physical  nxxlelling  of  real  devices 
is  bound  up  with  understanding  and  developing  models  of  the  underlying  technological 
processes. 

The  physical  nxxlellers  have  often  dealt  with  the  esoteric:  the  spate  of  quantum  “devices” 
investigated  in  the  1980s  would  have  achieved  short  shrift  with  the  empiricists  had  they 
followed  the  literature.  After  all  it  is  assumed  that  devices  should  be  interconnectible,  isolatable 
aixl  designed  to  function  at  room  temperature  not  milli-degiees  Kelvin.  Again  due  to  a  certain 
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innocent  over-familiarity  with  the  field  many  advocates  of  quantum  devices  fail  to  pdnt  out  the 
frequent  necessity  to  use  lock-in  amplifiers  in  operating  the  devices.  The  real  lesson  here  is  not 
that  weak  on  quantum  devices  is  only  of  academic  interest:  it  is  that  the  science  required  to 
achieve  engineering  goals  is  not  necessarily  the  same  science  that  drives  the  physical  scientist  to 
investigate  phenomena  within  devices.  This  problem  did  not  arise  with  the  quantum  chemists. 
Over  the  last  25  years  the  chemistry  community  has  mutually  developed  molecular  modelling, 
drug  design  and  molecular  g^phics/design  software  which  is  not  only  relevant  to  science  and 
industry  but  mixes  the  empirical  and  physical  to  advantage;  is  also  readily  available  and  widely 
used.  My  own  interests  in  molecular  electronics  have  led  me  to  have  a  healthy  respect  for  the 
sheer  accessibility  of  computational  techniques  and  databases  to  the  chemistry  community. 
There  are  distinct  signs  now  that  the  physical  modelling/empirical  industrial  rift  is  healing 
within  the  electronics  community.  This  is  coming  about  partly  because  of  better  control  and 
understanding  of  real  materials  and  the  need  to  include  “esoterics”  in  device  design  e.g.  hot 
electron  effects.  There  are  also  questions  of  “attitude”:  we  have  all  heard  of  the  physical 
scientists  easy  dismissal  of  engineering  problems  as  containing  no  interesting  physics.  Often 
this  is  misleading  and  in  our  own  recent  work  (with  Asenov  and  Brown)  on  applying 
(technology  transfer  ?)  microscopic  device  modelling  techniques  to  large  power  transistors  (2(X) 
A  switches)  we  have  developed  considerable  respect  for  the  complexity  and  subtlety  of  the 
underlying  physical  problem.  More  significantly,  the  immense  complexity  of  realistic 
quantitative  as  opposed  to  qualitative  physical  device  simulation  such  as  Laux’s  Monte  Carlo 
models  of  MOSFET  devices  is  becoming  manageable  with  the  advent  of  very  high  power 
workstations. 

IV.  Machine  power  I:  workstations 

Early  next  year  my  University  will  relinquish  its  three  mainfiames  (including  an  IBM  3090 
vector  processor)  in  favour  of  a  distributed  system  of  workstations  inter-connected  by  fast 
networking.  Other  Universities  here  and  abroad  have  already  experienced  this  trauma  -  and 
survived.  The  argument  is  compelling:  why  share  a  126  Mbyte  mainframe  with  400  other 
people  when  you  can  have  a  cheap  256  Mbyte  workstation,  with  instant  on-line  colour  graphics 
for  yourself .  For  a  little  more  money  one  can  expect  to  have  100  MFLOP  workstations  for  use 
within  modelling  groups  over  the  next  few  years.  If  the  dreams  of  modellers  from  the  1970s 
are  to  be  believed  this  already  is  extreme  overkill  for  computational  electronics  problems.  Of 
course  that  is  not  the  case,  but  it  is  the  case  that  this  development  should  lead  to  very  much 
closer  links  between  academics  and  industry  on  grounds  of  cost  alone. 

There  are  questions  of  fashion  however.  There  is  still  some  cachet  in  using  SUN 
workstations.  My  own  pedestrian  approach  is  to  use  rretworked  Apple  Quadras  running  Unix, 
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X-Windows  and  Mac  environments(and  even  DOS!)  concurrently,  which  at  lower  cost  and 
stxne  performance  penalties  allows  everyday  word-processing,  graphics,  imvie  making,  e-mail 
and  so  on  to  be  integrated  with  the  normal  workstation  role.  The  catch  of  course  is  that  the 
system  is  linked  to  a  large  transputer  array  for  compute-intensive  tasks.  This  example  raises  the 
issues  of  hardware  and  operating  system  standards.  My  own  solution  is  partly  cost  based 
(more  for  less)  and  partly  with  an  eye  on  upgrade  routes  and  inter-woridng  with  IBM  systems, 
UNIX,  DOS  and  MAC.  It  also  raises  the  issue  of  the  human  interface:  should  we  be 
developing  specialist  code  or  code  that  is  easily  used  within  familiar  environments  by  non¬ 
specialists?  If  the  answer  is  yes  we  have  to  confront  the  problem  of  huge  software  design  costs 
and  that  extra  memory  and  power  starts  to  lode  less  effective. 

What  power  do  we  really  need?  As  we  shall  discuss  later  some  problems  require  GFLOP  -> 
TFLOP  power  and  >  GBytes  memory.  How  do  we  get  it?  National  super  computers  or  local 
solutions? 

V.  Machine  power  II:  vectorisation  and  parallelisation 

Even  with  the  projected  advances  in  integrated  circuits  it  is  unlikely  that  desktop  single 
processor  GFLOP  power  is  imminent.  Parallel  processing  in  the  widest  sense  of  the  word  is 
usually  advocated  as  the  way  ahead  and  of  course  forms  the  basis  of  many  super  computer 
approaches.  Vectorisation  is  the  least  flexible  in  one  sense  and  so  is  easier  to  use,  full-scale 
parallelisation,  for  example  in  u  software  configurable  transputer  array  is  often  highly  efficient 
in  run-time  but  may  be  costly  in  development  time.  If  the  new  found  universality  offered  to 
computationalists  by  the  workstation  revolution  is  not  to  be  over-turned  it  is  essential  that  some 
order  and  standardisation  is  brought  into  parallelised  codes  for  general  and  local  use.  Our  own 
approach  at  Glasgow  is  to  cut  development  tinK  and  improve  portability  by  re-generating  all 
our  codes  in  common  parallelisable  format,  for  which  we  have  chosen  finite-element  algorithms 
as  the  main  format  which  can  cover  classical  device  modelling  through  to  many  electron 
quantum  transport  These  codes  are  designed  to  run  on  arbitrary  numbers  of  transputers  and  so 
can  be  developed  in  simple  workstations  before  transfer  to  the  main  cluster.  It  appears  at 
present  that  the  main  lesson  is  to  use  the  woikstation  as  the  main  hardware  tool  but 
supplemented  by  networking  to  compute  intensive  local  parallel  machines  of  intermediate 
power. 

These  arguments  for  the  workstation  approach  pale  into  insignificance  when  we  consider  the 
oppOTtunides  that  workstations  now  offer  for  visualisation. 

6.  Visualisation 
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The  chemistry  community  was  quick  to  develop  visualisation  tools,  many  of  which  are  now 
available  in  the  public  domain  for  quite  ordinary  workstations.  It  is  thus  possible  to  put  on 
appropriate  goggles(plus  IR  sensor)  and  guide  the  viewer  through  a  colour  stereo  journey 
through  a  molecular  structure  on  a  Silicon  Graphics  workstation.  Virtual  reality  systems  have 
been  pioneered  by  the  computational  chemists.  In  a  sense  this  is  obvious,  the  complexity  of 
molecular  design  is  in  direct  space:  it  is  thiee-dimensionally  geometrical  and  topological..  In 
device  modelling  the  effective  parameter  spaces  are  less  physically  obvious  and  often  high 
dimensional.  We  have  yet  to  learn  how  best  to  represent  dynamic  data  for  real  devices.  The  3D 
representation  of  electron  and  hole  trajectories  colour  coded  for  temperature  is  already  good 
practice  and  drawing  a  leaf  from  quantum  chemistry  the  use  of  3D  transparent  (cloud 
representation)  colour  contours  for  potentials  of  temperanue  surfaces  should  be  manageable. 
This  requires  24  bit  colour  high  resolution  workstations  and  of  course  soaks  up  memory  and 
CPU  time  (graphics  accelerators  of  course  are  a  way  out).  At  Glasgow  we  hpve  been 
developing  movie  and  graphics  techniques  to  help  us  gain  intuition  into  the  transient  few 
electron  problem  in  quantum  waveguides  and  single  electronic  systems.  Like  the  mediaeval 
schoolmen  there  is  nothing  to  be  gained  by  not  trying  things  out.  I  believe  that  visualisation  is 
crucial  to  the  development  of  computational  electronics  especially  if  the  aim  is  to  provide  access 
to  general  designers  and  experimenters.  Multiple  displays  with  friendly  interfaces  will  be 
essential.  This  route  should  not  be  left  to  chance  it  should  be  organised  and  draw  on  the 
experience  of  other  communities.  In  particular  the  input/output  of  visualisation  should  be  linked 
to  other  software  such  as  CAD  codes  where  possible. 

VII.  Software 

The  demands  of  visualisation,  operating  systems,  portability,  parallelisation  bring  us  to  the 
prospects  for  the  software  side  of  the  business.  It  also  raises  the  issue  of  what  level  we  should 
develop  our  computational  models.  First  some  generalities.  It  is  becoming  obvious  that 
computational  electronics  involves  large  codes  which  are  both  continuously  developing  and 
meeting  new  demands  from  interface  and  visualisation  requirements.  Although  there  is  huge 
investment  in  it,  FORTRAN  is  not  the  best  medium  for  managing  and  verifying  large  quickly 
developable,  de-buggable  codes  (my  own  bias  was  already  admitted  earlier  - 1  was  brought  up 
on  ALGOL).  C,  C*"*",  PASCAL  all  have  their  merits  especially  in  object  oriented  forms.  The 
answer  is  not  to  choose  the  best  (I  actually  am  attracted  to  functional  programming  languages) 
but  to  choose  the  best  for  the  community. 

On  visualisation  should  we  be  developing  our  own  codes  or  providing  hooks  into  public 
domain  or  commercial  software?  I  would  argue  for  both  because  today’s  best  visualisation 
software  has  evolved  from  earlier  scientific  studies. 
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One  of  the  most  interesting  developments  has  been  the  spread  of  software  such  as  MATHCAD 
and  MATHEMATICA  which  provide  highly  simplified  frameworks  for  quickly  solving  what 
used  to  be  formidable  problems  (with  excellent  graphics  thrown  in).  The  use  of  symbolic 
techniques  and  functional  programming  make  these  approaches  very  attractive  for  certain 
problems  (for  exaiiq>le  siiiq}le  ID  tunnelling  simulations).  But  they  have  little  efficiency  and  it 
is  often  difficult  to  program  them  for  efficiency.  Nevertheless  these  approaches  do  provide 
universality. 

VIII.  Organised  modelling 

Like  synchronised  swimming  I  believe  that  organised  modelling  will  eventual  appear  sensible 
and  exciting.  We  need  to  learn  from  communities  like  the  chemists  that  there  is  mutual 
advantage  in  sharing,  documenting  and  jointly  developing  community  codes  for  device 
DKxlelling.  There  are  too  many  stand-alone  operations  to  bring  the  promise  of  universal  access 
to  tiKxlelling  closer. 

There  is  a  great  deal  of  in-house  expertise  which  is  fiercely  guarded  (not  always  with 
justification).  I  would  like  to  see  this  type  f  woricshop  develop  into  a  forum  that  aims  to  set 
standards,  goals,  achieve  inter-woridng  and  sharing  of  knowledge  so  that  full  attention  can  be 
paid  to  triplications  driven  research.  Paradoxically  some  organisation  is  necessary  because  of 
the  problems  of  diversification  which  come  from  the  freedom  brought  about  by  the  workstation 
revolution.  How  this  should  be  achieved  is  a  subject  for  debate. 

IX.  New  opportunities 

Finally,  let  us  turn  to  a  selection  of  the  new  opportunities  for  the  core  of  computational 
electronics  -  the  models,  applications  and  tools  themselves.  Many  of  these  are  represented  at 
this  workshop,  some  are  steeped  in  antiquity,  others  are  striving  for  birth. 

In  classical/semi-classical  models  die  challenges  are  in  TeraHertz  high  frequency  device 
modelling,  the  inclusion  of  thermal  effects,  the  incorporation  of  process  models.  Despite  the 
conqilexity  it  is  becoming  possible  to  devise  physical  models  which  quantitatively  describe  real 
devices.  These  will  be  essential  as  integrated  circuit  technology  heads  into  the  fully  sub-micron 
regime.  The  advent  of  surface  probe  technology  (STM,  AFM  etc.)  and  on-chip  electrometry 
and  capacitance  techniques  will  t^n  up  the  hitherto  problematic  surface  and  interface  charging 
proUems  which  will  underpin  the  model  development  There  are  considerable  opportunities  for 
naodelling  which  is  designed  to  interpret  the  new  atomic  scale  probes. 

The  same  on-chip  techniques  have  recently  allowed  direct  observation  of  the  ubiquitos 
fluctuation  potential  which  bedevils  quantum  devices  and  the  direct  observation  of  many 
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electron  states  and  their  filling  in  quantum  dot  structures.  The  application  to  ultra-small 
commercially  miented  devices  will  be  challenging  and  will  require  active  modelling  went..  The 
aim  is  to  understand  and  control  the  intra-  and  extra-device  environment  in  detail. 

The  shift  to  ultra-uiudl  devices  and  devices  with  ultra-small  capacitative  regions  brings  up  the 
issue  of  quantum  charging  effects;  Coulomb  blockade,  correlated  tunelling,  macroscopic 
quantum  tunneling,  telegraph  and  shot  noise  effects  associated  with  the  discreteness  of  the 
electronic  charge.  The  modelling  required  here  and  for  the  now  extensive  range  of  single 
electronic  systems  is  non-triviai..  Single  electronics  is  scaleable  to  high  temperatures  by 
fabricating  sufficiently  small  capacitative  tunnel  junctions.  It  is  already  integratable.  Technique 
from  the  charge  density  matrix  ,  the  quantum  Langevin  equation,  Monte  Carlo,  Traffic  theory 
and  linear  programming  are  all  being  brought  to  bear.  The  single  electron  problem  actually 
turns  out  to  be  a  many  body  problem  and  much  remains  to  be  discovered.  Device-device 
coupling  is  an  unavoidable  reality  in  such  systems  and  extended  systems  of  devices  need  to  be 
modelled.  Control  of  killer  efects  such  as  quantum  fluctuations,  charge-trapping  de-trapping 
and  cross-talk  can  in  principle  be  handled  by  designing  the  substrate  electrostatic  environment 
correctly:  3D  modelling  on  heterostmetres  is  essential  here. 

There  is  now  enormous  scope  for  quantitative  quantum  modelling  but  we  should  be  careful  to 
distinguish  the  physically  interesting  from  the  engineering  possibilities.  To  restore  some 
balance  I  should  remark  that  my  own  interests  prtly  involve  studying  quantum  device  systems 
fabricated  within  two-dimensional  electron  gases  which  can  act  as  atomic  scale  instrumentation 
for  exploring  hithto  intractable  regimes  of  quantum  physics  such  as  delayed-choice,  empty 
wave  and  quantum  measurement  problems.  In  this  context  we  have  been  recently  successful  in 
modifying  the  Bohm  pilot  wave  version  of  quantum  mechanics  in  setting  up  and  solving 
problems  in  quantum  transport  theory. 

If  we  pursue  the  limit  of  miniaturisation  of  devices  to  its  logical  conclusion  we  encounter 
structures  on  molecular  scales.  The  problems  of  bioelectronics,  biosensors,  molecular  sensors, 
molecular  electronics  and  nanoelectrochemistry  all  converge  with  device  electronics  at 
nanometre  scales.  Even  the  techniques  in  quantum  chemistry  for  studying  the  STM  tunnel 
curent  and  energy  spectrum  of  a  small  molecule  bound  to  a  surface  bear  striking  resemblance  to 
quantum  dot  modelling.  The  long  range  tunnelling  of  electrons  through  polymer  chains,  redox 
centre  arrays  and  enymes  is  closely  related  to  the  electronic  soliton  dynamics  of  single 
electronic  systems. 

The  future  then  is  rich  in  promise,  quantitative  models,  evolving  interdisciplinarity  and  a  chance 
to  develop  a  strong  mature  modelling  community  properly  integrated  with  industry.  1  look 
forward  to  a  new  golden  age. 
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Abstmet  This  paper  focuses  on  the  discretisation  of  the  hydrodynamic  transport  equations  for 
electrons  in  sub-micron  MOSFET  structures.  In  particular  initialisation  of  the  electron 
temperature  equation,  and  the  important  problem  of  accurately  calculating  the  energy  input 
term  (J  E)  on  a  generalised  triangular  FE  mesh  will  be  considered.  The  discretisation  is 
applied  to  the  calculation  of  substrate  leakage  currents  in  sub-micron  LDD  MOSFETs,  the 
results  of  which  will  be  presented. 

I.  Introduction 

As  the  size  of  active  devices  in  VLSI  sub-systems  is  continually  reduced  the  validity  of 
the  standard  drift-diffusion  equations,  as  commonly  ^lied  to  nxxlel  semiconductor  devices,  is 
brought  into  question.  The  baric  assun^>tion  made  in  the  derivation  of  the  drift-diffusion 
framework  is  that  the  charged  carriers  in  a  device  are  in  dtermal  equilibrium  with  the  lattice 
phonons  at  some  ambient  ten^raturc. 

The  kinetic  Boltzmann  transport  equation  is  the  starting  point  for  describing  a  many- 
particle  cairier-phonon  system,  however,  direct  solution  of  this  equation  is  a  massive 
conrq)utational  task  and  is  therefore  generally  not  attempted.  A  more  practical  approach  is  to  use 
the  first-five  moments  of  the  Boltzmann  transport  equation  to  provi^  an  approximate  solution 
which  neglects  the  higher-order  moments.  These  moment  equations  are  commonly  referred  to  as 
the  hydrodynamic  equations,  which  are  basically  conservation  of  charge,  conservation  of 
momentum  (one  in  each  of  the  three  spatial  dimensions)  and  the  conservation  of  energy.  This  is 
an  acceptable  ^roximation  provided  thoe  is  sufficient  randomisation  of  the  thermal  energy  in 
the  system  to  allow  the  meaningful  use  of  average  quantities  such  as  velocity  and  effective  carrier 
temperature.  Under  these  conditions,  the  hydrodynamic  equations,  when  used  in  the  relaxation 
time  limit,  provide  a  practical  oigineering  platform,  which  can  be  used  to  investigate  hot-carrier 
effects  such  as  velocity  saturation  and  velocity  ovo^hoot 


This  pqier  describes  the  discretisation  of  these  equations  using  the  Control  Region 
Approximation.  This  method  is  general  and  is  not  restricted  to  a  particular  shape  of  elonent, 
however,  we  will  assume  the  use  of  triangidar  elements.  This  has  particular  importance  in  the 
definition  of  the  energy  input  into  the  energy  moment  equation.  This  will  be  discussed  more  fully 
in  section  6. 


109 


H.  Physical  &|ijahen  System 


Aj^dication  of  the  moment  method  to  the  Boltzmann  Tnuisport  Equation  produces  a  series  ai 
equations,  one  for  each  moment  taken.  The  resulting  steady  state  elec^n  energy  moment 
equation  can  be  written  as 


VS,-EJ,  +  «w.  +  n-^52»— ^^  =  0  (1) 

^  M 

here  S,  is  the  energy  flux  vector,  the  second  tenn  describes  the  energy  input  into  the  system,  and 
the  last  two  terms  describe  energy  loss  in  the  system  tte  first  due  to  carriers  recombining  and  the 
last  due  to  inelastic  collisions  in  the  device.  The  average  electron  energy  is  described  by  w, ,  ^  is 
the  recombination  rate  and  x  ^  is  the  energy  relaxation  time.  The  energy  flux  term  is  described 
the  constitutive  relationship 


S.=Q.-^K  +  kT^)  (2) 

here  Q,  is  a  heat  flux  term  which  is  identically  zero  under  the  symmetric  Maxwellian  assumption, 
however  it  is  included  heuristically  here  to  account  for  the  energy  flow  due  to  carrier  thermal 
gradients. 

The  conservation  of  carriers  is  described  by  the  steady  state  continuity  equations  for  electrons  and 
holes 


VJ,-q/f  =  0 

(3) 

V.J,+<7/?  =  0 

(4) 

which  again  require  the  constitutive  relationships  for  flux  in  titis  case  current  densities.  For 
electrons  the  appropriate  moment  equation  along  with  the  assumption  that  the  momentum 
relaxation  time  is  very  short  compare  to  the  energy  relaxation  time  lea^  to 


J.  =  +  p.knVTg  +  ti.kTjVn  (5) 

where  p,  is  die  electron  mobility,  n  is  the  electron  cotK^ntration,  y  is  the  electrostatic  potential,  k 
is  the  Boltzmann  constant  and  is  the  effective  electron  temperature.  Hole  transport  is  assumed 
to  be  described  by  the  standard  Drift-Diffusion  fiamework. 

J,  =  +  p/r^Vp  (6) 

where  Ti,  is  the  ambient  lattice  tenqierature.  Finally,  Poisstm's  equation  is  solved  for  electrostatic 
potratial 


V.D-p  =  0 


(7) 
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where  D  is  the  electric  flux  vector  given  by 


D  =  eE  =  -€Vy  (8) 

e  is  the  dielectric  constant  for  the  nuterial.  Equations  (1),  (3),  (4)  and  (7)  along  with  the 
constitutive  relationships  (2),  (5),  (6)  and  (8)  form  a  closed  set  of  equations  in  the  variables  T^,  n, 
p  and  y .  Which  can  be  discretised  using  the  control  region  ^>proxiniation. 


M  Physical  Models 
Mobility  model 

At  elevated  electron  temperatures  carrier  mobility  is  reduced  by  increased  scattering  rates  this 
causes  the  carrier  velocity  to  saturate.  The  electron  temperature  can  now  be  used  as  a  parameter 
in  the  electron  mobility  model.  In  effect  it  plays  the  same  role  as  the  electric  field  does  in  the 
standard  drift-diffusion  context.  In  this  work  an  electron  temperature  dependent  model  is  used 
which  is  consistent  with  the  conventional  field  dependent  model  for  mobility  under  homogeneous 
conditions,  which  has  the  form 


1  +  X 

k/iTg  kgTf^  ] 

-  <7  JJ 

(9) 


is  the  low-field  mobility  and  Xis  related  to  the  election  saturation  velocity  v,  and  the  energy 
relaxation  time  x  ^  by 


1-ik 


(10) 


Impact  Ionisation  Model 


In  hydrodynamic  simulations,  non-local  effects  on  carrier  transport  are  of  primary  concern. 
Energy  consideration  in  carrier  transport  plays  a  crucial  role  at  this  level  of  simulation. 
Accordingly,  the  suitability  of  using  local  field-dependent  impact  ionisation  iiKxlels  in 
hydrodynamic  simulations  is  not  at  all  obvious.  To  alleviate  firis  problem,  the  electric  field 
dependence  of  the  well-known  Chynowedi's  enyiiical  formula  for  impact  ionisation  coefficient 
a.is  replaced  by  an  appropriate  electron  temperature  dependence.  This  requires  the  establishment 
of  a  relationship  between  the  electric  field  and  electron  tempnature.  For  the  homogeneous  case, 
the  energy  equation  (1)  together  with  the  mobility  model  (9)  can  be  used  to  obtain  such  a 
relationship 


IkOkzZI 

.2 


(11) 


Using  the  Chenoweth  formula  for  a. 
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(12) 


where  a  and  b  are  eni|arical  coefficients,  one  can  model  the  non-local  effects  of  im{)act  ionisation 
through  the  non-local  solution  of  electron  ten^ierature. 

IV  Control  Roglon  Approximation 

The  partial  differential  equations  that  make  up  the  device  nxxlel  are  all  in  the  divergence  form, 
that  is  diey  have  the  divergence  of  some  flux  vector  quantity  and  a  number  of  source  terms.  In 
general  then  they  can  be  written 


VF-5(r)  =  0 


(13) 


Over  some  local  region  of  the  device  O  performing  a  surface  integral  gives 

Jj  VFdO-JJ  S(r)dn  =  0 

Q  Q 

^  (14) 

^n-FdT-JI  S(r)dC2  =  0 
r  Q 

where  n  is  the  outward  normal  unit  vector  and  F  is  tire  boundary  of  the  region.  This  is  now  in  a 
form  suitable  for  discretisation.  Consider  a  typical  node  i  in  a  triangular  mesh  as  shown  in  Figure 
1. 


Figure  1.  A  control  region  £1  (shaded  area),  of  area  A^,  surrounding  node  i 

The  region  of  integration  O  becomes  the  Voronoi  region  (shaded)  and  the  boundary  F  becomes 
tire  boundary  of  the  Voronoi  region.  The  line  integral  in  (9)  can  now  be  q^nroxinutied  by  a  piece- 
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wise  summation  of  the  flux  across  each  of  the  edge  segments.  The  integration  of  the  source  t»rm 
is  apimiximated  by  assuming  a  it  to  be  constant  at  the  value  found  at  the  actual  node,  thus 


(15) 

/I,  is  the  Vwonoi  area  associated  with  node  i,  M,  is  the  number  of  edge  connected  to  the  node, 
and  d^  is  the  pipe  width  connecting  the  nodes  along  line  ij.  So  providing  we  can  produce  a 

suitable  method  of  discretising  the  flux  vecttvs  along  an  edge  we  can  {q>ply  (1 1)  to  all  nodes  in  the 
mesh,  the  resulting  set  of  non-Unear  equations  can  be  solved  using  the  Newton-Riqihson  mefliod. 


V  Discrttisad  form  of  tho  Semiconductor  Equations 

The  set  of  divergence  equations  to  solve  are 


f  +  +  A  =0 

(16) 

i-1  V  Ji 

(17) 

Mi 

(18) 

ial 

Mi 

(19) 

eel 


The  constituent  relationships  are  readily  discretised,  for  the  first  three  equations  the  nxxlified 
Schaifetter-Gummel  methc^  is  used,  and  for  the  electric  flux  the  standard  finite-difference 
expression  is  used.  The  equations  are  assembled  using  the  standard  finite  element  ^)proach  on  an 
element  by  element  basis.  The  linear  equations  are  solved  using  the  Bi-CXiSTA6  metiiod  [1], 


Vi  Discretisation  of  the  Energy  Source  Term  (E  •  J.) 

This  term  inovides  energy  input  into  the  electron  oisemble,  and  its  proper  discretisation  is 
therefme  essential.  The  difficulty  in  discretising  tius  term  arises  from  the  inconsistencies  of  edge 
currents  in  a  element  When  the  electric  fidd  (E)  is  to  be  determined,  a  unique  value  for  an 
elonent  is  readily  found  from  the  nodal  potentials.  This,  however,  diis  is  not  true  for  the  current 
density  field  (J,),  since  the  Scharfetter-Gummel  method  used  in  the  discretisation  will  only  yield 
consistent  edge  currents  when  the  current  flow  is  purely  one-dimmsionaL  In  order  to  resolve  tiiis 
problem  several  techniques  have  been  suggested  [23.4].  The  first  method  of  Laux  is  found  to  be 
extremely  unstable,  especially  in  the  drain  depletion  region,  where  there  are  rqnd  changes  in 
carrier  amcmtration  and  the  weighting  metiKxi  can  yield  a  distribution  of  heating  which  is  both 
erratic  and  unrealistic.  The  otiier  two  mediods  use  conq>letely  (Uffierent  iqtproaches  to  the 
discretisation  of  the  (E  - J,)  term,  nevertheless,  the  discretised  forms  are  almost  identical  apart 
from  small  second  ordo*  differences.  These  terms  have  significant  effect  on  the  stability  of  the 


solution  procedure,  since  they  occur  in  off  diagonal  terms  of  the  system  Jacobian,  and  under  some 

conditions  destroy  the  diagonal  dominance  of  the  system.  This  means  that  unless  an  extremely 

small  mesh  spacing  is  used,  which  is  unphysical  and  computationally  expensive,  then  a  solution 

cannot  be  found  for  drain  bias  values  over  a  few  volts.  For  this  reason  the  power  scheme  [3]  is  ' 

preferred,  as  it  does  not  {Hesent  any  apparent  convergence  problems. 

VII  Solution  algorithm 

i 

Figure  2  shows  a  flow  diagram  of  the  solution  algorithm.  Initially  a  fully  coupled  drift-diffusion 
solution  is  sought  which  is  then  used  to  provide  an  initial  guess  for  electron  temperature.  It  also 
provides  a  good  guess  for  the  other  solution  variables.  A  decoupled  scheme  is  then  used  to  obtain 
a  self  consistent  solution  of  the  hydrodynamic  equations.  Typically  the  number  of  iterations 
required  for  self-consistency  increase  with  an  increase  in  electron  temperature.  4 


VHIRmuHs 


The  results  of  the  hydrodynamic  simulation  are  demonstrated  for  a  LDD  device  structure,  and 
validated  against  expeiimoital  data  on  a  range  of  devices  of  diflering  gate  lengths.  Figure  3 
shows  the  election  concentration  for  a  5^m  long  channel  device  and  the  associated  electron 
temperature.  As  expected  the  electron  temperature  is  hi^st  at  the  drain  depletion  edge,  there  is 
also  a  smaller  amount  of  heating  in  tire  channel  depletion  region.  Once  the  temperature 
distribution  is  established  at  a  particular  bias  point  the  local  impact  ionisation  rate  can  be 
calculated  bas  on  the  expression  ^ven  in  equadons  (11)  and  (12),  from  which  the  substrate 
cuirent  can  be  evaluated  by  summation  across  die  device  and  weighting  by  the  Voronoi  area 

/rt.=  (20) 


where  V,  is  the  saturation  velocity.  Figure  4  shows  the  terminal  currents  for  a  0.8p.m  length 
device,  compared  with  experimental  results  for  the  same  structure.  In  order  to  get  such  a  good  fit 
an  energy  relaxation  time  of  0.02ps  has  been  used,  and  a  saturation  velocity  of  9xl0‘  cm/s  has 
been  used.  The  agreement  with  these  same  parameters  is  equally  as  good  for  shorter  gate  lengths. 

iX  Discussion 

In  his  prqier  we  have  presented  a  self-consistent  model  capable  of  solving  the  hydrodynamic 
equations  in  semiconductor  devices.  The  equations  have  been  discretised  on  a  general  mesh  using 
the  control  region  approximation,  careful  consideration  has  been  paid  to  the  discretisation  of  the 
heat  source  term. 

Results  of  the  method  have  been  demonstrated  for  an  LDD  MOSFET  structure  and  found  to  give 
excellent  agreement  with  measure  drain  and  substrate  currents. 
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(a)  electron  distribution  VDS  *  3.0V,  VGS  ^3JV 

(b)  electron  temperature  VDS  *  3.0V.  VGS  *  2  JV 
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Abstract 

In  this  work,  we  present  a  general  2*dimensional  spherical  harmonic  formulation  of  Boltz¬ 
mann’s  transport  equation.  Until  recently,  numerical  implementation  of  this  approach 
has  been  discussed  for  either  1 -dimensional  geometries,  or  only  a  few  of  the  spherical  har¬ 
monics  [1,2].  In  this  paper,  a  formulation  is  presented  that  includes  an  arbitrary  number 
of  spherical  harmonics. 

I.  Introduction 

Device  modeling  by  direct  solution  to  the  Boltzmann  transport  equation  is  usually  not 
performed  because  of  dimensionality  problems  and  difficulties  in  evaluating  the  collision 
integral.  To  directly  account  for  2-dimensional  device  operation  with  the  Boltzmann  equa¬ 
tion,  we  would  normally  have  to  perform  calculations  in  5  dimensions  (2  dimensions  in 
real-space  and  3  dimensions  in  momentum-space).  To  overc-ome  this  ‘curse  of  dimension¬ 
ality’  a  new  approach  to  solve  the  BTE  in  device  models  is  being  developed  which  uses  a 
spherical  harmonic  (SH)  or  a  Legendre  polynomial  expansion. 

II.  Indefinite  Spherical  Harmonics  Expansion 

One  of  the  reasons  why  we  are  interested  in  this  method  is  that  it  gives  us  the  ability 
to  produce  differential-difference  operators  for  the  evaluation  of  the  collision  integral  to 
all  orders  in  the  expansion.  Finding  expressions  for  the  rest  of  the  BTE  operators,  how¬ 
ever,  involves  some  work.  To  solve  the  2-D  BTE  to  higb-order  accuracy,  the  distribution 
function  is  expressed  in  terms  of  an  infinite  spherical  harmonic  expansion  with  unknown 
coefficients  that  depend  on  energy  wd  position: 

/(r,k)  =  j:r(r.t)»r(«.«  (1) 

/m 

Yi^{0,  ift)  are  the  spherical  harmonics[3];  are  the  coefficients  which  are 

to  be  determined;  f  =  0, 1, 2, . ;  and  for  each  I,  the  superscript  m  =  — /,  —/-|- 

. /-I,/. 

The  spherical  harmonics  give  the  angular  dependence  of  the  distribution  function  in  mo¬ 
mentum  space,  and  the  coefficients  provide  its  magnitude.  The  SH-numeric^  formulation 
allows  us  to  account  for  the  angular  dependence  of  the  distribution  function  in  momentum 
space  (9,  <f>)  analytically,  thereby  reducing  the  dimensionality  of  our  calculations  from  5 
to  3. 
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in.  SH  Formulation  to  Arbitrarily  High  Order  Accuracy 

The  objective  is  now  to  determine  the  unknown  coefficients  /”(r,£),  which  can  be  used  to 
construct  the  distribution  function.  Furthermore,  to  minimize  the  possibility  of  truncation 
errors  in  the  SH  approach,  we  have  developed  a  technique  to  determine  the  coefficients  to 
arbitrarily  high  order.  The  basic  idea  behind  the  approach  is  to  automatically  generate 
a  system  of  equations  for  all  the  unknown  SH  coefficients  //"(r,  e),  and  then  solve  the 
system  and  construct  the  distribution  function  using  the  above  summation. 

To  generate  this  set  of  equations,  we  first  substitute  the  above  summation  into  the  BTC. 
Next,  we  project  the  BTE  onto  each  of  the  SH  basis  functions.  The  projection  onto  the 
l,m'th  SH  basis  function,  which  yields  an  equation  for  the  l^m'tk  coefficient,  is  illustrated 
by  the  following  operation. 

By  performing  a  similar  projection  onto  each  of  the  SH  basis  functions,  the  angular 
dependence  of  the  distribution  function  is  integrated  out,  and  an  infinite  system  of  coupled 
equations  is  generated  for  the  unknown  coefficients. 

In  principle,  this  set  of  projections  could  be  performed  as  is,  leaving  a  system  of  differential- 
difference-integral  equations  for  the  unknown  coefficients  However,  the  initial 

substitution  of  the  SH  expansion  into  the  various  terms  of  the  BTE  gives  rise  to  many 
nonlinear  products  of  SH  basis  functions.  Projecting  these  nonlinear  products  by  per¬ 
forming  the  indicated  integrations  would  become  unwieldy.  Furthermore,  since  an  infinite 
SH  expansion  is  present,  each  equation  would  contain  an  infinite  number  of  terms,  and 
each  equation  would  therefore  be  directly  coupled  to  all  the  other  generated  equations. 

To  simplify  the  system,  we  take  advantage  of  the  SH  recurrence  relations[3].  These  re¬ 
lations  allow  us  to  re-express  all  nonlinear  products  of  SH  basis  functions  in  terms  of 
linear  combinations.  Once  each  term  in  the  BTE  is  expressed  as  a  linear  combination, 
we  can  take  advantage  of  the  orthogonality  of  sphericad  harmonics  and  easily  perform  the 
projections  indicated  by  Eqn.  (2). 

IV.  Generalized  System  of  SH  Equations 

After  using  recursion  and  performing  the  indicated  integrations,  we  obtain  the  remarkable 
result  that  almost  all  of  the  infinite  terms  in  each  equation  vanish  identically  due  to 
orthogonality.  Furthermore,  we  find  that  the  coupling  between  equations  is  only  through 
neighbors.  Another  extremely  useful  result  is  that  each  equation  has  an  identical  form. 
The  system  can  therefore  be  automatically  generated  to  arbitrarily  high  order  and  then  be 
solved  numerically.  The  equation  for  the  /,m  SH  coefficient  is  given  below.  The  equation 
for  any  of  the  other  SH  coefficients  is  obtsdned  by  appropriately  changing  the  value  of  the 
indices  /,  m  to  other  allowed  integers: 
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where  v{£)  =  7'  =  d-y/de,  and  7  represents  the  dispersion  relation;  the  sum  is  over 

the  2  directions  in  the  x  —  z  plane;  and  the  operators  a  have  been  defined  as, 


K/r  =  ^  (4) 

=  <>r-r'»rr.  (5) 

(6) 


where  aj"  =  Due  to  the  need  to  consider  self-consistent  boundary  conditions 

in  multi-dimensional  problems,  it  is  necessary  to  produce  a  set  of  2nd  order  equations 
from  this  set  of  1st  order  equations.  Before  doing  this,  however,  it  is  possible  to  eliminate 
mixed  partial  derivative  operators  such  as  that  would  occur  from  such  a  substitution 
procedure  by  introducing  the  guage  transformation, 

£-»£'-  e<f>{x)  =  H  (7) 

as  discussed  in  Ref.  [2].  Here,  the  effect  of  the  transformation  is  to  produce  the  same  set 
of  equations  as  Ekjn.  (2)  except  that  now,  the  energy  derivatives  are  no  longer  present. 
This  is  because  the  derivatives  with  respect  to  position  now  have  a  new  meaning,  i.e.  they 
are  evaluated  for  fixed  values  of  H. 

V.  Results 

We  have  solved  this  generalized  system  for  the  2-D  space-independent  case.  This  system 
is  truncated,  discretized  and  solved  numerically.  Solving  the  space  independent  BTE  to 
10  orders  requires  less  than  a  minute  of  CPU  time  on  a  Sun4  work-station.  Fig.  1  shows 
the  SH  coefficients  coresponding  to  /  =  0, 2, 4  for  a  100^  electric  field  45  degrees  from  the 
P3  axis  in  the  111  direction.  Fig.  2  shows  a  comparison  of  this  result  with  the  case  of  the 
electric  field  in  the  direction  of  the  pa  axis.  The  purpose  is  to  demonstrate  how  different 
SH  coefficients  become  important  to  resolve  the  2-dimensional  angular  dependence  of  the 
distribution  function.  Calculation  of  the  isotropic  coefficient  remains  the  same,  while  the 
magnitude  of  higher  order  coefficients  change  in  response  to  the  changing  values  of  the 
spherical  harmonics.  It  is  believed  that  some  numerical  error  is  present  at  low  energy  for 
the  higher  order  coefficients  as  it  was  expected  that  the  shapes  of  the  coefficients  of  the 
same  order  in  /  should  be  the  same. 
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1.  SH  Coefficients  corresponding  to  1“0,2,4  for  s  100  kV/ca 
c  field  In  the  111  direction. 


Fig.  2.  A  co^arlson  of  SH  coefficients  cslculated  for  two  different 
field  orientations. 


VI.  Summary 

We  have  developed  a  new  method  for  solving  the  BTE  for  2-dimensional  geometries. 
The  method  reduces  the  dimensionality  of  the  problem  from  5  to  3.  Theoretically,  the 
orientation  of  the  coordinate  system  should  not  affect  the  calculation  of  the  distribution 
function,  however  in  practice  with  the  direct  method,  a  small  numerical  difference  is 
noticed. 
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Fig.  3.  Isotropic  coefficient  /o  for  different  homogeneous  applied  electric 
fields.  Comparison  of  the  LP  calculation  with  the  Monte  Carlo  method  was 
excellent  up 'to  energies  higher  than  2eV. 
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Comparison  of  Cellular  Automata  and  MINIMOS 
simulations  of  submicron  MOSFETs 


A.  Ran,  G.  Zandler  and  P.  Lugli 

Physik  Department  and  Walter  Schottky  Institut,  TU  Munchen,  D-85747  Garching,  FRG 

Abstract 

We  present  a  detailed  comparison  of  a  novel  cellular  autCMnaton  (CA)  technique  and  a  standard 
drift  diffusion  calculation  (ME^IIMOS)  of  high  field  transport  in  semiconductor  devices.  The 
CA  method  may  be  viewed  as  equivalent  to  the  Monte  Carlo  technique  but  can  easily  handle 
ensembles  with  more  than  10^  particles,  can  efficiently  (teal  with  complex  geometries  and 
achieve  accelerations  on  multiprocessor  computers  that  scale  linearly  with  the  number  of 
processors.  W^th  this  new  technique  Si  MOSFETs  have  been  simulated  for  different  gate 
lengths  and  gate  voltages  and  the  results  compared  to  MINIMOS. 

I.  Introduction 

Device  simulation  has  become  a  crucial  and  strategic  part  of  today’s  microelectronics  [1-3]. 
Drift  diffusion  approaches  [4]  are  commonly  used  for  device  mtxtelling  because  of  their 
intrinsic  speed  They  suffer  though  from  the  several  approximations  of  the  underlying  physical 
mo(tel.  The  Monte  Carlo  (MC)  method  [5,6]  belongs  on  the  contrary  to  the  most  sophisticated 
but  at  the  same  time  the  most  costly,  of  all  simulators.  For  this  reason,  the  MC  method  still 
remains  restricted  to  university  and  laboratory  research,  and  has  not  yet  become  a  common 
modeling  t(X>l.  It  would  therefore  be  highly  (tesirable  to  develop  a  simulator  of  comparable 
physical  content  as  the  MC,  but  much  faster  and  also  enable  of  exploiting  the  potential 
offered  by  vector  and  parallel  processors  more  naturally. 

Recendy,  a  new  method  has  been  developed  which  appears  to  meet  these  requisites,  the 
Cellular  Automata  (CA)  tqiproach  [7,8].  So  far,  <Mily  preliminary  tests  of  its  applicability  to 
device  modeling  have  been  carried  out.  In  this  paper,  a  critical  assessment  of  the  strengths 
and  limitations  of  the  CA  for  realistic  device  simulations  is  given  by  presenting  the  first 
quantitative  and  detailed  comparison  between  the  CA  and  a  standard  Drift-Diffusion  approach 
based  on  the  MINIMOS  code  [1]  for  a  Si  MOSFET. 

II.  The  Cellular  automaton  for  device  simulations 

Like  the  MC  metiKxi,  the  CA  is  a  physical  approach  to  carrier  transpem  in  semiconductors 
based  on  the  simulation  of  a  raiKlom  walk  of  classical  particles  subject  to  probability  scattering 
events  by  phonons,  impurities,  other  carriers,  whose  energy  (tepenitence  is  evaluated  from 
Fermi’s  Golden  rule  [5,6]. 

In  general,  the  electric  field  acting  on  the  particles  is  a  function  of  position;  in  emter  to  fully 
account  for  ncm-homogeneous  situations  (and  therefive  to  simulate  semiconductor  devices)  the 
MC  and  the  CA  simulations  have  to  be  self-consistentiy  coupled  to  Poisson’s  equation  [5,6]. 
Recently,  the  full  BE  for  carrier  transport  in  semiconductors  has  been  transfimned  to  a  CA 
[7].  This  cemstitutes  an  impertant  improvement,  since  CA  are  traditionally  only  used  in  the 
cemtext  of  transport,  for  fluid  dynamics  [9,10]  cn*  drift-diffusion  simulations  [11].  In  genoal, 
a  cellular  aut(nnaton  consists  of  a  lattice  with  a  finite  number  of  states  attached  to  each  lattice 
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site.  The  population  of  these  states  is  simultaneously  updated  according  to  deterministic  or 

nondeterministic  rules  in  discrete  tin^  steps.  The  dynamics  of  CA  are  gover^d  by  local 

rules.  i.e.  updating  site  variables  involves  only  a  small  number  of  nei^bors  in  each  time 

step.  For  this  reason,  CA  can  t^timally  utilize  massively  parallel  ctMnputer  technology.  In  < 

addition,  the  locality  of  the  dynamical  rules  allows  an  efficient  and  flexible  treatment  of 

complex  geometries.  The  major  characteristics  of  CA  are  the  two  length  scales  they  operate 

on.  The  first  described  by  the  discrete  miciowtvld  on  a  lattice  obeying  a  ficticious  dynamics 

of  pseudo-particles,  whose  length  and  time  scales  are  much  shorter  than  the  physical  scales. 

The  second  is  a  continuous  macroscale  with  the  physical  observables,  which  are  obtained  in  * 

practice  by  taking  averages  over  many  cells. 

In  its  implementation  for  the  solution  of  the  BE,  the  CA  consists  of  a  lattice  in  position 

space,  each  site  of  which  has  a  finite  number  of  momentum  states.  The  nondeterministic 

transition  rules  between  these  states  associated  to  collision  events  are  determined  from  the 

quantum  mechanical  scattering  rates  (in  the  same  way  as  in  MC)  and  from  the  classical 

equations  of  motion.  Due  to  the  locality  in  position  space  of  quantum  mechanical  scattering 

events,  which  is  a  basic  assumption  underlying  the  BE,  there  is  no  principle  problem  to 

convert  these  transitions  in  momentum  space  into  local  CA-rules.  On  the  other  hand,  the  drift 

and  diffusion  terms  in  the  BE  link  the  distribution  function  to  its  value  in  different  position  , 

and  momentum  space  locations,  being  therefore  nonlocal  in  natiue.  Additionally,  a  single 

semiclassical  particle  trajectory  cannot  in  general  be  reproduced  exactly  on  the  discretized 

phase  space  of  the  automata.  Therefore,  the  kinetic  terms  of  the  BE  are  replaced  by  hopping 

probabilities  in  both  position  and  momentum  space  in  such  a  way  that  the  equations  of  motion 

are  fulfilled  on  the  average  for  an  ensemble  of  pseudoparticles.  By  an  appropriate  choice  of  ( 

lattice  constant  a  and  timestep  dt,  the  hopping  events  can  be  restricted  to  transitions  between 

marest  neighbors  (or  second  nearest  neighbors).  This  choice  is  restricted  by  the  desired 

resolution  in  position  space  and  by  the  maximum  physically  relevant  velocity,  which  must  be 

less  than  aldt.  This  procedure  results  in  a  master  equation  for  the  state  occupancies  which 

omtains  only  on-site  transitions  between  different  momentum  states  and  transitions  between  i 

nearest  neighbors  with  the  same  momentum  state.  For  a  rigorous  derivation  and  a  detailed 

discussion  of  the  basic  algorithm,  see  [7]. 

We  have  estimated  the  number  of  operations  that  translate  into  MFLOPS  required  on  a  scalar 

processor  for  simplified  MC  and  CA  device  simulations  on  a  fixed  field  distribution  that 

includes  nonparabolic  bands  and  the  standard  scattering  mechanisms  for  GaAs.  We  find  that  * 

an  ensemble  MC  simulation  requires  at  least  lO^^A^  operations,  where  N  is  the  number  of 

particles  in  the  simulated  ensemble,  while  the  CA  requires  about  \0^*N  operations,  providing 

a  speed-up  of  two  orders  of  magnitudes  with  respect  to  MC.  The  principal  difference  is 

that  the  CA  maps  the  Boltzmarm  equation  onto  a  set  of  discrete  rate  equations  for  discrete 

variables  that  involve  only  integer  arithmetic.  In  momentum  space,  this  discrete  dynamics  * 

allows  one  to  use  predetermined  scattering  tables  both  for  the  total  scattering  rates  as  well  as 

for  the  free  flights.  In  the  CA,  the  final  k-state  is  picked  by  a  simple  assignment  rather  than 

by  the  algebraic  solution  of  nonlinear  equations  as  in  the  MC.  In  addition,  the  motion  in  real 

space  is  discrete  in  the  CA  and  consists  of  deterministic  hq)s  between  nearest  neighbor  cells.  , 

Thus,  the  teal  space  motion  is  automatically  synchronized  and  occurs  in  a  strictly  mdered  and 

predictable  fashion,  in  contrast  to  the  MC,  where  the  exact  continuous  trajectories  of  each 
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particle  have  to  be  fdlowed.  Clearly,  very  efficient  Poisstm  solvers  need  to  be  coupled  to  the 
CA  in  self-consistent  device  simulations.  Our  experience  shows  that  SOR  algorithms  are  too 
slow  and  can  be  the  bottleneck  of  a  CA  simulation,  particularly  for  large  deling  gradients. 

Well  known  alternatives  to  SOR  methods  are  also  FACR  and  FFT  algwithms  [12].  It  should  be  ( 

mentioned  that  the  initialization  phase  of  a  simulation,  on  the  other  hand,  requires  significantly 

more  effort  in  the  CA  method  since  all  scattering  tables  need  to  be  calculated.  Fortunately, 

it  needs  to  be  executed  only  once.  Modem  massively  parallel  computers  employing  either 

the  message  passing  or  shared  memory  paradigm  offer  a  substantial  reduction  in  turnaround 

time,  provided  efficient  algorithms  can  be  implemented  on  such  an  architecture.  Indeed,  * 

both  the  MC  and  the  CA  can  be  implemented  on  MIMD  (multiple  instruction  multiple  data) 

machines  by  assigning  subdomains  of  the  whole  device  to  the  individual  processors  [13].  Each 

process^  executes  one  time  step  of  the  simulation  on  its  subdomain,  collects  the  properties 

of  those  particles  that  will  leave  the  subdomain  at  the  end  of  this  time  step,  passes  this 

information  to  its  neighboring  processors,  and  receives  information  from  them  about  particles  * 

entering  its  subdomain  in  the  next  time  step.  Such  collection  of  information  about  the  particle 

exchange  with  the  neighbexing  processors  requires  more  computations  than  needed  on  a  single 

processor.  This  overhead  is  the  price  for  the  distribution  of  the  computational  load  emto  many 

processors.  We  have  calculated  the  ratio  of  this  overhead  for  the  CA  and  MC  thod.  Let  ^ 

Npf  be  the  number  of  particles  assigned  to  one  processor.  Within  a  MC  approach,  we  estimate 

that  it  takes  every  processor  10^*Npp  operations  to  collect  and  digest  the  information  about 

the  outgoing  and  incoming  particles.  In  the  CA,  this  overhead  is  independent  of  the  number 

of  particles  but  only  depends  tm  the  number  of  real  space  cells  per  subdomain  and  the 

number  of  discrete  k-states  per  cell  N^.  through  the  relation  10~'^Nky/N^.  With  typical  I 

values  for  device  simulations,  Npp  -10^,  Nnu  -  lO*,  -10^,  one  finds  that  the  time  fw  the 

message-related  computations  required  on  each  processor  is  two  orders  of  magnitude  smaller 

in  the  CA  than  in  MC. 

Hi.  CA  vs.  MINIMOS  simulation  of  Si-MOSFET  ^ 

In  order  to  test  the  capability  of  the  CA  to  handle  the  complex  structure  of  a  Si  n-MOSFET, 
a  detailed  comparison  with  a  standard  drift-diffusion  algorithm  (MINIMOS  [1])  has  been 
performed.  The  CA  simulation  employs  a  non-parabolic  band  structure  for  electrons  and  a 
parabolic  dispersion  for  one  effective  hole  band.  Furthermore,  optical  and  acoustic  phonon¬ 
scattering  as  well  as  impact  ionization  and  impurity-scattering  are  taken  into  account.  Source  ^ 

and  drain  contacts  are  0.24  /rm  long,  each  separated  by  10  nm  from  the  gate  contact.  An  oxide 
layer  of  S  nm  thickness  has  been  used  (Fig.  1).  The  doping  profile  is  the  one  calculated  by 
the  MINIMOS  pre-processor.  The  high  carrier  density  in  a  MOSFET  requires  large  particle 
ensembles  in  order  to  reduce  the  statistical  noise  and  we  have  used  300000  particles.  On 
several  scalar  RISC  workstations,  the  execution  time  for  the  CA  turned  out  to  be  10-15  times  4 

larger  than  MINIMOS. 

In  figure  2(a)  we  show  the  drain  characteristics  for  two  different  gate  voltages  (1.5  V  and 

2.5  V).  The  gate  length  (Lo=  0.25  /xm)  has  been  chosen  close  to  the  limit  where  MINIMOS 

simulations  can  still  be  expected  to  be  reliable.  The  drain  characteristics  agree  well  with  one 

another,  except  in  the  regime  of  voltages  above  2  V,  where  a  slight  velocity  overshoot  (which  ^ 

is  not  accounted  for  in  the  drift-diffusitm  approach)  acts  to  increase  the  drain  current. 

Hgure  2(b)  shows  the  I-V  characteristics  for  a  shorter  gate  length  of  0.16  ^m.  Short 


4 


125 


om 

IOM 


Qap 

10  IM 


L,-0.2S>iiw 


L,-0.24  (im 


p-tauffar 


0.75  |im 

FIG  1.  Ocomeiiy  of  •  0..2S  pm  MOSFET. 
In  x-dkcctian  the  device  is  subdivided  into 
126  and  in  y-direction  into  78  blodcs,  lespec- 
tively.  The  n**  doping  for  the  contact  region 
is  ltf*cm-*. 


channel  effects  become  more  important  in  this  regime, 
and  the  two  methods  give  significantly  different  drain 
characteristics.  This  originaies  almost  totally  from  a 
significant  velocity  overshoot  that  is  accounted  for  by 
the  CA  simulation  but  not  by  MINIMOS.  In  fact,  as 
shown  in  Hg.  3  (a),  the  electron  channel  density  of  the 
0.16  /im  MOSFET  is  very  similar  for  the  two  methods 
up  to  an  applied  voltage  of  4.0  V  and  a  gate  voltage 
of  1.5  V.  In  contrast,  the  drift  velocity  as  t^nained  in 
the  CA  shows  a  significant  overshoot  for  drain  voltages 
above  0.6  Volt,  in  contrast  to  the  MINIMOS  results 
(Fig.  3  (b)).  At  last  we  present  our  investigation 
about  impact  ionisation.  For  the  cellular  automaton 
we  used  a  microscopic  impact  ionisation  model  for 
high-field  energy  electron  transput  [14]  and  adapt  the 
homogeneous  itmisation  rate  to  experimental  data  [IS]. 


In  figure  4<a)  and  4(b)  position  resolved  distributions  of  impact  ionisation  events  are  given  for 
the  0.25  ftm  MOSFET  at  a  drain  voltage  Ud  =  4.0  V  and  gate  voltage  Uq  =  1-S  V  has  been 
used.  Figure  4(a)  illustrates  the  MINIMOS  simulation,  where  the  maximum  of  the  impact 
itmisation  is  located  between  gate  and  drain.  This  also  holds  for  the  distribution  of  impact 
ionisation  using  the  cellular  automaton  (Hg.  4(b)),  but  the  maximum  is  shifted  closer  to  the 
intersection  of  gate  and  drain.  The  overall  agreement  is  still  very  satisfactory. 


FIG  2.  (i)  Drain  current  vs.  drain  voltage  for  a  0.23  pm  Si  -  8M)SFET  calculaird  with  die  drift  diffusion  model  MINIMOS 
(triangles)  and  with  the  cellular  automaton  (full  line),  (b)  I>ain  cunertt  vs.  drain  voltage  for  a  0.16  pm  Si  -  MOSFET. 


FIG  3.  (a)  Vertically  mtegrated  sheet  density  of  electrons  in  the  0.16  pm  MOSFET  (  UD-4n  Mbit;  Ua«  1.3  Vilt).  The  foil 
line  repreaenii  the  CA  calculadon  and  triangles  results  of  MINIMOS.  (b)  Vertically  weighted  drift  velocity  under  the  gate 
(vm  indicatee  the  saturmion  velacity  of  bulk  silioon). 
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FIQ  4.  Cooiosr  pkM  of  impact  kmiuiion  me  per  votame  (10'*cm'*pc'‘]  for  a  0^  ittn  MOSFET  (U|>»4X)  V.  Ua-1.5V)  in 
the  laie  region  of  the  device.  The  blade  wide  line  along  the  x-axie  indicatee  the  gate  contact  (a)  MINIMOS  (b)  CA. 

IV.  Conduslon 

have  inesented  a  detailed  comparison  of  a  novel  cellular  automaton  (CA)  technique  and 
a  standard  drift  diffusion  calculation  (MINIMOS)  of  high  field  transport  in  semiconductor 
devices.  Good  agreement  between  both  methods  is  found  for  simulations  of  a  submicitHi 
MOSFET  within  the  regime  of  validity  of  MINIMOS.  while  for  very  short  gate  length  nonlocal 
transport  effects  significantly  influence  the  results  of  the  CA  simulation.  In  particular,  the 
occurence  of  velocity  overshoot  in  die  channel  of  the  device  is  responsible  for  the  enhanced 
drain  current  in  the  CA  simulations. 
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Abatnct 

A  new,  fast,  two-dimensional  model  is  presented  that  couples  the  classical  semiconductor 
transport  equatitms  with  quantum  mechanics.  It  is  particulariy  suited  to  the  simulation  of  single 
channel,  pseudomoiphic  and  multi-channel  HF^s  and  delta  doped  structures,  where  the 
primary  craiduction  path  between  source  and  drain  is  via  carriers  cmifined  to  a  two-dimensional 
potoitial  u«ll.  The  model  solves  Poisson's  and  the  current-continuity  equation  self  consistently 
with  the  effective  mass  Schrttdinger  equation,  the  latter  taken  in  slices  perpendicular  to  the 
heterojunctimi.  Current  flow  is  modelled  by  restricting  the  discretised  "quantum"  electrons  to 
two-dimensional  motirm,  neglecting  rKxi-equilibrium  dynamics.  The  model  improves  on 
previously  reported  versions  by  allowing  a  full  two-dimensional  treatment  of  the  Fermi-level, 
allowing  a  non-equilibrium  treatment  of  Schrodinger's  equatim. 


Introduction 

The  increasing  use  of  AlGaAs/lnGaAs/GaAs  heterostructure  devices  has  lead  to  great  interest 
in  modelling  the  electrons  omfined  within  2  Dimensional  Electron  Gas  (2DEG)  layers. 
Classical  schemes  are  based  upon  the  electron  waveftinction  be’mg  described  as  Bloch  states, 
which  in  turn  are  derived  from  flat  band  conditions.  When  events  occur  that  approach  the  size 
of  the  electron  wavepacket,  ie.  the  de-Broglie  waveloigth,  this  approximation  is  invalid  and 
quantum  effects  have  to  be  included.  Unfortunately  a  rigorous  solution  of  the  quantum 
mechanical  equations  becomes  very  involved  and  computationally  intensive,  limiting  freir  use 
within  physical  device  modelling.  For  this  reason  a  simplified  scheme  has  been  adopted 
whereby  quantum  effects  are  modelled  by  solving  the  effective  mass  Schrbdinger  equation  (1) 
across  the  heterojunction  but  assuming  the  classical  approximaticms  to  hold  elsewhere. 


2qdy[m*  ^  J 


(1) 


where 


Vj,  is  the  heterojunctitm  potential  and  V^^die  exchange-correlation  energy  [1].  This  approach 
is  justified  as  the  smallest  event  in  the  x-directkm  is  die  gate  which  is  at  least  an  order  of 
magnitude  larger  than  the  electron  wavelength.  The  electron  density  is  now  described  by 
equation  (3),  [2],  instead  of  the  classical  Fermi-integral. 


I 


"  *  E  1^*1^  ^  e*P  -  I 

The  model  therefore  solves  Poisson’s  equation  self-consistently  with  the  classical  charge 
tfansptMt  equation,  with  the  modification  that  the  electrons  are  divided  into  two  classes.  The 
first  posessing  "quantised  motion"  are  only  permitted  to  move  parallel  to  the  heterojunction, 
and  are  formed  ^m  the  electrons  whose  intersub-band  separation  is  greater  than  the  thermal 
enei;gy,  ksT.  The  second  are  normal  three-dimensimial  electnms,  which,  in  order  to  reduce  the 
computaticMial  demands  are  approximated  within  the  model  by  the  Fermi-integral  taken  fiom 
a  quasi-crmtinuous  conduction  band  edge  shown  in  Figure  1. 


Figure  1 .  Conduction  band  edge  of  a  pseudomorphic  AlGaAs/lnGaAs/GaAs  HFET  showing 
the  separation  of  the  two-  and  three-dimensional  electrons. 


Simulatioo  Details 

The  equations  are  discretised  over  a  non-uniform  rectangular  mesh  using  central  finite 
differences.  Schrodinger's  equation  is  solved,  first  by  finding  the  eigenvalues  via  a  bisection 
method  based  upon  a  "Sturm"  sequence  {3].  This  method  is  both  fast  and  robust,  with  the 
upper  and  lower  bounds  initially  set  using  Gershgorin's  theorem  |4],  and  then  updated  using 
the  last  calculated  eigenvalue.  The  eigenvalues  are  then  substituted  into  Schrbdinger's  equation 
and  eigenvectors  found  using  a  Newton  iterative  scheme.  The  current-continuity  equation  was 
formulated  using  current  densities  calculated  at  the  half-nodes,  assuming  the  independent 
variables  i|r  and  ^  vary  linearly  in  between  the  nodes.  This  has  the  advantage  that  no 
"Scharfetter-Gummel"  inteipolation  scheme  is  necessary,  substantially  simplifying  the  device 
equations. 

The  principle  device  equations  are  all  highly  non-linear  in  the  independent  variables  <{>,  i|r. 
Consequently  a  modified  Newton-Raphson  iterative  scheme  was  employed  which  requires  the 
Jacobian,  a  matrix  formed  fix>m  the  partial  derivatives  of  the  functions  with  respect  to  each  of 
the  variables.  All  of  the  terms  in  the  two  equations,  with  the  exception  of  the  partial  derivative 
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of  iijD  with  respect  to  are  readily  differentiable.  This  term  depends  explicitly  upon  the  partial 
derivatives  of  (k  *tid  which  are  calculated  using  pemirtMtion  theory,  equation  (4) 


at 


s  -1 


(4) 


using  this  result  it  is  evident  by  inspection  of  equatton  (3)  that 

^  ^2D 


(5) 


Results  I 

The  model  has  been  used  to  simulate  several  devices  including  single  channel,  pseudomorphic 
and  multichannel  FETs.  A  typical  pseudomorphic  device  structure  is  shown  in  Figure  2.  The 
ohmic  contacts  are  assumed  to  extend  to  the  2DEG  and  thus  the  modelled  region  is  truncated 
placing  the  source  and  drain  contacts  at  the  sides.  A  lumped  access  resistance  is  then  added 
explicitly.  * 


■once  gate  (  0  J  |( )  (hain 


Figure  2.  A  typical  pseudomorphic  HFET  structure  used  in  the  simulations. 


Hgures  3  and  4  show  Fermi-level  and  conduction  band  edge  profiles,  taken  at  s  i.7V  and 
Vqs®  OV.  The  Fermi-level  shows  significant  distortitm  around  the  gate,  clearly  perturbed  from 
equilibrium,  although  in  die  InGaAs  channel  and  GaAs  substrate  the  gradient  of  the  Fermi- 
level  (driving  force)  is  nearly  parallel  to  the  heterojunction.  The  cmducticm  band  edge  shows 
the  two-dimensional  potential  well  ftnrned  in  the  InGaAs. 


130 


4 


4 


Figure  4.  Conduction  band  edge  for  a  pseudomorphic  HFET. 


Figure  5  shows  the  Ids-^ds  characteristics  for  this  device,  comparing  quantum  (solid  lines)  with 
classical  Fermi-integral  solutions,  from  which  it  is  evident  that  the  quantum  simulation  has  a  ^ 

lower  output  current.  This  is  mainly  attributable  to  the  reduced  carrier  density  produced  by  this 
scheme,  as  current  flow  in  the  y-direction  is  mainly  limited  by  the  high  access  resistances  to 
the  2DEG  regions  which  are  populated  by  three-dimensional  electrons  in  both  cases. 


4 


4 


131 


V*  /V 


Figure  S.  lor^os  curves  for  the  quantum  and  classical  simulations. 


Conclusions 

A  two-dimensional  HFET  model  incorporating  quantum  mechanics  is  presented  that  solves 
Schrddinger's  equation  in  a  more  self-consistent  fashion  than  has  previously  been  reported.  The 
quantisation  in  electron  motion  is  explicitly  taken  into  account,  and  found  to  have  little  effect 
in  the  fmal  current-voltage  curves.  A  significant  reduction  in  current  is  observed  between 
quantum  and  classical  models,  principally  attributable  to  a  lower  free  electron  density. 
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Abstract 

In  recent  years,  the  effective  field  approach  has  been  developed  in  terms  of  hydrodynamic 
variables  [1,  2,  3].  In  this  paper,  the  effective  field  is  formulated  for  the  calculation  of 
transport  coefficients  in  a  drift-diffusion  model.  With  these  transport  parameters,  it  was 
found  that  calculations  of  the  energy  distribution  function  tul,  and  average  velocity  match 
suprisingly  well  with  Monte  Carlo  results,  for  a  variety  of  cases.  Therefore,  results  so  far 
encourage  further  investigations  into  the  effectiveness  of  characterizing  device  performance 
along  with  device  reliability  in  terms  of  the  effective  field  concept. 

I.  Introduction 

Recently,  it  has  been  shown  that  in  some  cases  the  effective  field  determined  from  the 
average  energy  is  sufficient  for  characterizing  the  shape  of  space  dependent  energy  distri¬ 
bution  function  tails  [4].  In  this  work,  the  effective  field  is  defined  as  the  table  of  electric 
field  values  that  are  generated  by  finding,  at  each  space  point,  the  best  match  between  a 
homogeneous  field  and  inhomogeneous  field  calculation  of  the  energy  distribution  function 
tail.  Then,  the  effective  field  concept  is  extended  to  include  computations  of  mobilities 
and  diffusion  coefficients,  with  justification  based  upon  the  Bogoliubov  ansatz  [5]  . 

n.  Modeling 

In  1-D,  the  following  analytical  formula  for  the  effective  field  has  been  obtained  from 
Boltzmann’s  equation. 


Here,  it  is  seen  that  the  derivative  of  the  electric  field  is  being  attenuated  by  an  exponential 
term,  thus  demonstrating  that  the  faster  the  field  vau'ies,  the  more  the  effective  field  will 
lag  behind  the  locally  applied  field.  This  behavior  is  in  qualitative  agreement  with  the 
observations  of  Ref.  [6].  The  quantity  A  is  referred  to  as  the  relaxation  length,  and  is 
shown  to  be  well  characterized  by  the  ratio  of  the  diffusion  coefficient  to  the  average 
velocity  u. 


(2) 
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This  it  ft  irftDtceodentft]  cqufttioo,  however  ^proximfttioos  cfto  be  introduced  by  consid- 
eriog  Fig.  1,  wiuch  it  ft  plot  of  the  rotio  of  diffuitoo  o)efficient  to  drift  velocity  for  a 
homogeoeout  ^plied  field.  This  plot  characterises  the  variation  of  A,  and  it  is  seen  that 
at  hi^  fields,  where  velodty  saturation  is  present,  A  varies  slowly. 


Fig.  1.  TMs  graph  deaonstrates  the  general  dependence  of  A  upon  the  size 
of  the  electric  field.  For  high  fields, A  beeones  the  length  scale  over  which 
the  average  envsrgy  relaxes. 

Once  the  effective  field  is  known,  the  drift  velocity  is  found  to  have  the  following  simple 
form, 


(3) 

which  is  a  formula  that  is  not  restricted  to  I'D  consido’ations.  Here,  vjk(£e//)  represents 
the  average  velocity  of  carriers  in  a  homogeneous  field  For  low  fields,  where  the  slope 
of  Vk{E,ff)  is  frirly  constant,  this  expression  reduces  to  the  conventional,  local  expression 
for  carrier  mobility.  In  the  limit  that  the  applied  field  no  longer  varies,  this  expression 
yields  the  correct  steady  state  velocity. 

Though  the  drift  velocity  demonstrates  the  presence  of  velocity  overshoot,  and  its  rela¬ 
tionship  to  a  varying  field,  there  are  diffusive  and  convective  contributions  to  the  average 
velocity  as  w^.  In  1-D,  the  following  drift-diffusion-convection  (DOC)  equation  first 
introduced  by  Sanchez  [6]  was  used  to  molve  these  components, 

■^{Dn)  +  nv,  +  n  <  uj}—  >«  j.  (4) 

The  presence  of  the  ^  term  is  due  to  the  effect  of  a  spatial  variation  in  the  collision 
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rate  caused  by  variations  in  the  donor  impurities.  Using  tbe  effective  field  concept,  the 
following  expressions  for  tbe  transport  parameters  were  obtained: 


*  -  f  dfvlT{x,t)f^{x,p) 

where  /*  is  the  distribution  function  obtained  from  tbe  effective  field.  Although  equation 
(4)  represents  an  initial  value  problem,  it  did  not  pose  any  major  problems  for  self- 
consistent  device  simulations  where  two  boundaries  are  present.  A  second  order  version 
of  this  equation  was  also  implemented. 

in.  Results 

Shown  in  Fig.  2  is  tbe  effective  field  profile  generated  from  a  Monte  Carlo  analysis  of 
the  channel  region  of  a  MOSFET  [3].  Also  plotted  is  the  effective  field  generated  from 
formula  (1),  exhibiting  a  remarkable  agreement.  Shown  in  Fig.  3  is  the  effective  field 
profile  determined  from  a  case  studied  in  Ref.  (7).  Also  plotted  is  the  expression  of  V|^(E) 
and  Vd  to  demonstrate  that  there  is  a  significant  difference  between  using  the  effective  field 
drift  term,  and  assuming  that  the  drift  velocity  depends  only  upon  tbe  locally  applied 
field. 


Fig.  2.  A  cowparisoB  of  affective  fields  from  ref.  [1]  and  forwula  (1).  A 
Monte  Carlo  slaulatlon  was  used  to  verify  the  value  of  the  effective  field 
for  Bost  of  the  points  In  ref.  (!]. 
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In  Fig.  4,  •  compariton  <rf  the  average  velocity  obtained  from  equation  (4)  and  a  Monte 
Carlo  calculation  is  performed.  For  additional  comparison,  the  first  order  (Pi)  Legendre 
polynomial  calculation  performed  in  Ref.  [7]  is  also  included.  As  can  be  swn,  good 
agreement  between  the  Monte  Carlo  calculation  and  the  effective  field  calculation  of  the 
average  velocity  is  obtained.  The  agreement  is  better  than  the  Pj  calculation  probably 
due  to  an  implicit  avoidance  of  truncation  error  in  the  effective  field  approach. 


!F 


Fig.  3.  The  drift  velocity (v^) ,  hoaogeneoua  field  veloclty(vh).  self- 
consistent  electric  field  (£) •  and  effective  fleld(£(//)are  plotted  for  a 
n'^’nn'^iode  slaulatlon.  Bias  *  5.0  Volts.  The  discrepancy  between  v^andv^ 
deaonstrate  the  laportance  of  the  non-local  dependence  of  the  woblllty. 


Pig.  4.  The  average  velocity  froa  a  drift-diffusion  (DD)  equation*  a  Monte 
Carlo  (MC)  slaulatlon,  and  a  let  order  Legendre  polynoalal  (PI)  calculation 
found  In  ref.  (4]  are  plotted  for  coaparlson.  Better  agreeaent  than  the  PI 
approach  ie  attributed  to  an  lapllclt  avoidance  of  truncation  error. 
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The  discrepancy  in  the  barrier  region  of  the  device  is  due  to  the  fact  that  the  diffusion 
coefficient  has  been  overestimated  by  the  effective  field  formulation  presented  so  far.  To 
account  for  temperature  reduction  due  to  the  bvrier,  the  effective  field  is  reformulated 
using  a  different  approximation  scheme.  This  approximation  scheme  insures  that  the 
correct  equilibrium  behavior  is  obtained,  and  predicts  temperature  lowering.  However,  a 
detailed  comparison  and  study  in  this  region  of  the  device  is  left  for  future  work. 

III.  Summary 

An  approach  for  device  modeling  using  the  drift-diffusion  model  has  been  developed  which 
does  not  use  any  empirically  determined  parameters  except  for  those  that  are  associ¬ 
ated  with  deformation  potentials  and  band  structures.  Considerations  so  far  have  been 
restricted  to  a  single-valley  model  of  silicon.  However,  the  generality  of  the  approach 
encour2iges  an  investigation  into  considerations  of  multiple  bands  and  anisotropic  effects. 
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ALGORITHMS  FOR  THE  SOLUTION  TO  THE  HELMHOLTZ  EQUATION  IN 
THE  NUMERICAL  SIMULATION  OF  SEMICONDUCTOR  LASERS. 
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Department  of  Electrical  and  Electronic  Engineering 
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Abstract 

In  the  numerical  simulation  of  semiconductor  lasers  a  fast  accurate  method  is  essential  for  the 
solution  to  the  Helmholtz  equation.  Various  algorithms  are  considered  for  this  purpose  in  terms  of 
speed,  efficiency  and  accuracy  and  a  new  algcnrithm  is  developed  based  on  the  effective  index  method 
with  a  novel  solution  to  the  algelxaic  eigenvalue  equation.  The  algtxithm  is  found  to  be  highly  efficient 
for  all  modes. 

L  Introduction 

As  laser  stmetures  increase  in  complexity  it  becomes  more  important  to  simulate  their  behaviour 
in  order  to  understand  phenomena  dependent  on  structure  and  hence  optimise  their  performance.  As 
manufacturing  techniques  improve  the  buried  heterostructure  (BH)  device  is  lil^ly  to  grow  in 
importance.  A  typical  BH  device  is  shown  schematically  in  figure  1  in  which  the  InGaAsP  active  region 
is  surrounded  by  the  wider  band  gap  InP  on  all  sides.  In  the  simulation  of  these  structures  the  wave 
equation  must  be  solved  to  obtain  the  optical  intensity  of  each  lasing  mode  and  the  aim  of  this  paper  is 
to  investigate  the  most  efficient  way  to  solve  this  equation. 


X 

Figure  1  Device  Structure 
n.  Modelling 

Assuming  TE  modes  dominate  then  the  wave  equation  may  be  given  in  scalar  form  as 

[^+^+«(«.y)k;-P’]E,=o  (1) 

where  e(x,y)  is  the  permittivity  (which  may  be  conq)lex  due  to  gain  or  loss),  E,  is  the  optical  fiekl,  k,  is 
the  fine  space  wave  number  and  ^  is  the  propagation  constant  This  equation  must  be  solved  in  two 
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dimensioiu  for  the  structure  in  figure  1.  Of  die  efficient  methods  that  exist  the  effective  index  method 
[1]  is  the  roost  conuiKmly  used,  however  as  the  active  area  decreases  in  size  and  the  solution 
aiqnoaches  cut-off  the  accuracy  deteriorates  [2].  The  weighted  index  method  [3]  is  an  alternative 
ai^iroach  which  attempts  to  increase  the  accuracy  of  the  solution  whilst  avoiding  the  complexity  of  a 
two-dimensional  ntethod 

The  Weighted  Index  Method 

The  weighted  index  method  is  an  im|xovement  on  the  effective  index  method  as  it  uses  a 
weighted  mean  cS  the  permittivites  in  the  y  dirKtion  for  each  node  in  the  x  axis  and  a  weighted  mean  of 
the  permittivities  in  the  x  direcdon  for  each  node  in  the  y  axis.  This  is  described  with  reference  to  figure 
2. 


y 


active 

region 
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Hgure2  The  Weighted  Index  Method 
The  best  trial  solution  is  found  for 

E  =  F(x)G(y)  (2) 

F  and  G  are  complex  functions  of  the  single  variables  x  and  y  respectively  and  satisfy  the  ordinary 


differential  equations 

(3) 

|£+K„G  =  ft^ 

(4) 

where  >nd  3y  are  the  prtpagation  constants  of  the  differential  equations.  Equation  3  represents  a 
cut  along  aa'  and  equation  4  represents  a  cut  along  bb’.  There  are  p  nodes  along  the  aa'  and  q  nodes 

along  bb'.  is  the  weighted  mean  of  e(x,y)k^  for  each  nodal  value  Xp.  The  mean  is  taken  along  the  y 
direction  using  weights  w^,  i.e. 


139 


(5) 


1 

LaasiinUar£uhionK(}  is  given  by 
Ko«k22w,e„  (6) 


The  wei^ts  are  defined  opdraally  as 


a) 

w,=^0’.dy 

(8) 

where  F  and  G  have  undergtme  die  normalisation 

JL/**i 

(9) 

£g*=i 

(10) 

The  best  value  of  ^  is  given  by  the  Rayleigh  quotient  [3] 

P*  *  Pi  +P5 


(11) 


p  1 

The  two  me-dimensional  wave  equations  (3,4)  are  coupled  via  the  weighting  factras  and  are 
solved  alteraately  until  the  value  for  ^  converges. 


Tlie  algebraic  eigenvalue  equation 

With  both  the  effective  and  weighted  index  methods  a  one  dimensional  algebraic  eigenvalue 
equation  must  be  solved. 


(12) 

where  A  is  complex  and  tndiagonal  and  the  q>tical  field  E  is  a  column  vector.  If  a  non-unifcmn  grid  is 
used  then  A  is  unsymmetrical  and  must  be  symmetrised  using  a  similari^  transfnnuttion.  The 
eigenvalue  and  eigenfunction  of  the  coefficient  matrix  A  give  the  propagation  constant  P  and  the 
cocreqponding  optkal  field  E  re^Kctively.  A  new  highly  efficient  method  is  described  next 

Evatuatkm  of  the  diaracteristic  pdynomial 

A  veiy  fest  method  to  obtain  any  required  eigenvalue  is  the  evaluation  of  the  characteristic 
polynomiaL  A  trial  value  of  die  eigenvalue  is  used  in  a  Sturm  sequence  which  can  then  be  used  to  locate 
the  required  eigenvalue  via  bisection.  For  any  trial  eigmvalue  X,  the  Sturm  sequence  is  defined  as 

P.(X)*1 

PiW  *(»•  -^)Pw(>-)-*5PmW 
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The  number  of  changes  in  sign  between  consecutive  numbers  (tf  the  sequence  is  equal  to  the 
number  of  eigenvalues  smaller  than  X.  The  advantage  of  this  method  is  its  flexilMlity  in  that  it  allows  all 
the  eigenvalues  to  be  found  in  a  given  range  however  the  disadvantage  is  it  cannot  c<^  with  complex 
matrices. 

An  alternative  method  is  inverse  power  iteration  with  successive  eigenvalue  refinement  It 
consists  of  iterating  around  the  equation 

skjXi  (14) 

where  kj  is  chosen  such  that  =1.  is  adjusted  on  each  iteration  via 

(15) 

K 

A  linear  system  of  equations  must  be  solved  at  each  step.  This  algorithm  is  fast  and  can  cope  with 
complex  eigenvalues  and  eigenvectors.  The  main  disadvantage  is  that  the  method  requires  good 
approximations  to  both  the  eigenvalue  and  the  eigenvector  of  the  problem  to  initialise.  However 
advantage  can  be  taken  of  the  fact  that  the  change  in  imaginary  con^nents  of  the  permittivity  is  much 
less  than  the  change  in  real  components.  This  is  due  to  the  BH  laser  being  index-guided  in  both  lateral 
and  transverse  directions.  To  obtain  a  good  initial  approximation  to  the  complex  problem  the  imaginary 
components  can  be  ignored  and  the  real  problem  can  be  solved  via  the  evaluation  of  the  characteristic 
polynomial  for  the  eigenvalue  and  an  inversion  performed  for  the  eigenvector.  The  results  can  then  be 
used  to  initialise  the  complex  inverse  iteration  with  successive  eigenvalue  refinement  algorithm  to  obtain 
the  complex  solution.  A  flowchart  of  the  algorithm  is  shown  in  fimim 


Hgure  3  Flow  Chart  of  Operation 
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m.  Results  and  Discussion  ,  ^ 

Results  sre  shown  in  figure  4  for  an  InGaAsP/InP  device  using  the  effective  index  Method  (EI)< 
the  Weiglned  Ktethod  (Wl)  and  a  two-dimensional  solution  (2D)[21.  The  fundamental  and  first 
order  mode  jiytirys  are  ralr»t«“vl  for  vaiying  channel  widths  (W).  In  this  example  wlwre  the  core  is 
0.35  tmerons  diick  the  effective  index  method  becomes  significantly  in  error  when  the  width  is  less  dian 
two  microns  i.e.  when  the  active  region  area  is  less  than  apjMOximately  0.6  ^im?- 

The  significance  of  the  above  results  will  now  be  considered  with  respect  to  the  design  of  BH 
lasers.  One  of  the  most  important  characteristics  of  these  devices  is  the  power  at  u*ich  the  first  1^  m 
the  light-current  characteristics  occurs.  This  kink  is  due  to  the  first  order  nxide  achieving  gain  and 
commencing  la^ne  The  above  results  indicate  that  for  accurate  determination  of  this  lasing  power  the 
El  method  is  insufficient  and  the  WI  method  is  preferred.  An  interesting  impUcation  of  the  results  given 
in  figure  4  is  that  the  El  method  will  always  underestimate  the  optical  power  at  which  the  first  order 
mode  starts  to  lase  and  the  weighted  index  method  wiU  always  overestimate  this  power.  This  is  a  di^t 
result  of  the  effective  index  method  overestimating  the  mode  index  and  hence  overestimating  the  optical 
gain.  Equivalently,  the  weighted  index  underesdmates  the  mode  index  and  thereby  underestimates  the 
optical  gain.  This  has  important  impUcations  for  device  engineers.  If  a  laser  is  designed  to  have  a 
minimum  output  power  at  which  the  first  order  mode  starts  to  lase,  then  using  both  the  WI  and  El 
methods,  they  can  be  reasonably  sure  that  the  experimental  power  will  be  bounded  by  the  results 
calculated  from  the  two  methods. 


Figure  4  Typical  Result 


(i)  FundamentaJ  mode 

(ii)  Fini-order  mode 
(INSET :  Device  structure) 
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Abetrnet 

A^taaliMtion  of  femicondcutor  bandstmctoret  ia  accompHiked  in  the  3-D  Btilloain  sone 
by  wing  cdor  rendering  techniques  to  generate  energy  isosnrfaoes  and  color  contour  plots 
on  crossectkmal  planes.  Combination  of  the  varioas  techniques  offers  an  efficient  way  to 
analyse  the  data  an  get  physical  insight  relevant  for  carrier  transport.  The  visualisation  is 
performed  on  a  Digital  5000  workstation  using  the  Application  \^sua]isation  System  (AVS) 
griq>lucs  software. 


I.  Introduction 

Advanced  investigations  of  transport  phenomena  in  semicondnctois  require  a  detailed 
knowledge  of  the  energy  bandstructnre.  This  is  particnlady  true  when  high  energy  effects 
are  investigated,  since  available  analytical  fcnmulations  of  the  bandstmctuxe  are  inaccurate 
at  incomplete.  For  instance,  some  Monte  Carlo  applications  suitable  for  the  simulation  of 
impact  ionisation  and  of  other  effects  rdevant  for  device  rdiabOity,  implement  algorithms 
with  a  full  bandstructure  for  the  evaluation  of  momentum  space  trajectories.  We  found 
that  3-D  visualisation  of  the  bandstructure  in  momentum  space  is  an  important  step  for  the 
understanding  of  the  transport  physics  and  for  the  devdopment  of  optimisation  techniques 
for  this  type  of  Monte  Carlo  simulations.  Since  scattering  rates  are  also  closdy  related 
to  the  bandstructure,  the  same  visualisation  techniques  can  be  ^>plied  to  analysis  of  the 
momentum  dependence  of  scatterings. 

We  have  experimented  with  a  number  of  available  visualisation  tools  and  we  have  found 
the  Application  Visualisation  System  (AVS)  software  to  be  very  suitable  for  these  3-D 
visuaHsation  problems.  In  the  AVS  environment,  most  of  the  applications  are  built  by  an 
interactiTe  symbdic  procedure,  assembling  an  ^plication  network  with  modules  available 
in  a  menu.  Custom  iq»plications  can  be  devdoped  by  writing  additimuJ  modules  using  C 
language  programming.  An  example  AVS  iq>plication  network  we  have  used  to  visualise 
the  bandstructure  of  nUoon  is  shown  in  1. 


n.  ’^^snalisntion  Approach 

The  iqiproach  which  we  found  most  effective  to  visualue  energy  bandstrutnres  is  an  ani- 
matkm  of  energy  isosnrfMes.  We  have  bnSt  a  Bxilhioia  sooe  and  we  display  the  isosurfoces 
in  the  cube  which  contains  it.  The  sHght  redundancy  of  information  which  falls  outside  the 
sane  hdps  in  getting  a  visual  appreciation  of  the  connection  between  adjacent  BtiUonin 
sones.  ffinoe  an  isosuxfaoe  <mfy  displays  information  for  <me  energy  value  at  a  time,  it  is 
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often  neoenerj  to  combine  tins  techniqne  with  others  to  get  s  more  complete  idea  of  the 
OTeraU  behasim.  Sereral  iaosnrftces  can  be  displayed  at  one  time  with  the  use  of  trans¬ 
parency  and  color  coding,  athoogh  this  may  lead  to  extremely  complicated  images.  This 
type  of  eisnaBaatwim  is  better  suited  to  TisnaKtation  of  only  the  irredndble  wedge  of  the 
BriUonin  lone.  On  a  fall  scale,  we  found  very  useful  to  combine  energy  isosurfaces  with 
cross-sectional  planes  where  a  colm  contour  plot  of  energy  is  represented.  The  isosurfaces 
are  generated  only  on  one  side  of  the  plane,  which  can  be  made  to  slide  back  and  forth, 
and  the  intersections  between  the  surfaces  and  the  plane  offer  uery  helpful  Tisuol  cues.  The 
c^Mihility  to  perfimn  rotations  of  the  sofid  figure  allows  then  the  xiewer  to  pinpoint  any 
specific  region  of  interest.  An  example  of  AVS  applieation  network  we  have  used  to  visualise 
the  bandstructure  of  silicon  is  shown  in  Fig.  1.  The  network  represents  actual  subroutines 
which  are  assembled  in  a  symbdic  way.  The  bandstructure  data  is  input  through  the 
branch  beginning  with  the  block  read  field  while  the  BriUonin  sone  is  constructed  by  the 
adjacent  branch  beginning  with  read  geom.  The  image  is  displayed  after  the  program  runs 
through  aU  the  blocka,  and  the  user  can  then  interactively  modify  the  image  attributes, 
like  orientation,  sise,  cdor  palette,  lighting,  to  a4just  the  image,  or  can  run  again  the 
program  by  simply  modifying  visualisation  parameters,  like  the  value  of  energy  isosuxface, 
the  position  of  the  cross-sectional  plane,  or  the  data  displayed  in  the  pven  geometry  (e.g. 
switch  from  bandstructure  to  scattering  rates).  The  program  can  also  be  changed  inter¬ 
actively,  by  breaking  the  connections  in  the  network,  and  adding  or  subtracting  modules. 
A  large  amount  of  information  can  be  visualised  at  one  time,  since  the  isosurfaces  can  be 
color  coded  with  another  parameter  (e.g.  scattering  rate)  so  that  by  scanning  isosurfaces 
at  different  enerpes  a  5-D  space  is  represented. 

We  present  here  several  examples  of  3-D  visualisation  for  the  silicon  energy  bandstruc¬ 
ture.  Figure  2a  shows  a  view  of  a  cross-sectional  100  plane  for  the  first  branch  of  the 
conduction  band  in  the  full  BiUluoin  sone.  For  the  take  of  reproduction,  we  use  here  a 
gray-scale  palette,  ranfpng  from  black  (lowest  energy)  to  light  gray  (highest  energy).  For 
color  representation,  we  normaUy  use  the  conventional  rainbow  fuJoTiiig  scheme,  ranging 
from  blue  to  red.  It  is  easy  to  detect  the  four  X-vaUeys  situated  on  this  plane  in  corre¬ 
spondence  of  the  darkest  rqpons.  The  same  image  for  the  second  branch  of  the  conduction 
band  is  shown  in  Fig.  4a.  In  both  images,  the  energy  isosnr&ces  are  behind  the  cross- 
section.  On  the  computer,  the  position  of  the  plane  can  be  moved  interactively  in  real  time 
by  translation  or  rotation.  Figures  2b,  3a,b  and  show  the  full  isosurfoce  configuration,  at 
various  energies,  for  he  first  conduction  band  branch,  and  4b  for  the  second  branch. 
Figure  5  and  6  show  results  for  the  conduction  band  of  gallium  arsenide.  The  silicon-like 
X-vaUey  isosurfaces  can  be  easily  identified  by  comparison  with  the  previous  images. 

The  data  for  the  bandstructure  were  obtained  by  using  an  eminrical  psendopotential 
approach.  All  the  images  are  based  on  a  cubic  uniform  medi  with  41  nodes  on  each  side. 
The  visualisation  was  performed  with  AVS  software  on  a  Digital  5000  workstation  and  the 
images  have  produced  on  a  Tektronix  dye  sublimation  printer. 
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Figure  1.  Simplified  AVS  network  used  for  the  vuualisntion. 


figure  2.  (a)  Gny-icale  oontou  plot  of  the  first  conduction  band  for  silicon  on  the  {100} 
plane;  (b)  Energy  isosnxfaoes  at  £  =  0.2  eV. 
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Abstract 

In  a  previous  paper,  it  has  been  suggested  to  employ  the  theory  of  stochastic  differen¬ 
tial  equations  (SDE)  for  the  modeling  of  electron  transp>ort  in  semiconductors.  It  was 
shown  that  the  differential  equations  which  describe  electron  transport  in  semiconduc¬ 
tors,  and  which  are  used  in  Monte  Carlo  simulations,  can  be  interpreted  as  stochastic 
differential  equations  driven  by  inhomogeneous  randomly  weighted  Poisson  processes. 
Based  on  this  connection,  the  theory  of  SDE  was  employed  to  demonstrate  that  the  for¬ 
ward  Kolmogorov- Feller  equation  which  characterizes  the  tr2msition  probability  density 
function  of  this  random  process  can  be  integrated  over  the  initial  conditions  to  obtain 
the  linear  Boltzmann  transport  equation. 

In  the  paper,  we  will  expand  on  our  previous  results  and  present  a  transport  model 
which  leads  to  the  efficient  numerical  computation  of  the  electron  distribution  function. 
The  model  is  based  on  an  approximation  of  the  collision  integral  in  the  Boltzmemn 
transport  equation  by  differential  operators.  As  a  result  of  this  approximation,  the 
electron  distribution  is  characterized  by  a  second  order  partial  differential  equation  in 
momentum  space.  It  turns  out  that  the  coefficients  of  this  partial  differential  equation 
are  the  first  and  second  order  moments  of  the  scattering  transition  rate.  We  will 
present  the  derivation  of  these  terms  and  also  present  some  preliminary  results  on  the 
computation  of  the  electron  distribution  function  based  on  this  model.  Furthermore, 
by  comparing  these  resiilts  with  Monte  Carlo  simulations,  we  will  explore  the  limits  of 
applicability  of  this  model. 


I.  Background: 

Previously,  it  has  been  suggested  that  the  theory  of  stochastic  differential  equations 
can  be  employed  to  model  current  transport  in  semiconductors,  (see  Ref.  [1]).  In  this 
section,  we  will  review  some  of  the  previous  results  and  establish  a  connection  between 


semi-classical  transport  theory  and  the  theory  of  stochastic  differential  equations.  Ac¬ 
cording  to  semi-classical  transport  theory,  an  electron  in  a  semiconductor  drifts  under 
the  influence  of  a  macroscopic  electric  field  and  experiences  occasional  random  jumps 
in  its  momentum  due  to  different  scattering  mechanisms  in  the  crystal,  such  as,  acous¬ 
tic  and  optical  phonons,  ionized  impurities,  etc.  This  motion  of  an  electron  can  be 
described  by  the  following  stochastic  differential  equation: 

f  =  (I) 

^dk  ^ 

+  (2) 

Fr  =  (3) 


Here,  x,  v  and  k  are  the  electron  position,  drift  velocity  and  wave  vector,  respectively. 
E  is  the  electric  field,  e{k)  is  the  energy- wave  vector  relationship  and  Fr  is  the  random 
impulse  force  on  the  electron  due  to  scattering.  This  random  force  can  be  characterized 
by  the  scattering  rate  X{k)  and  the  transition  rate  S(k,  k  ).  The  probability  of  scattering 
in  time  is  given  by  the  following  expression: 


Pr{ti  -ti-i  >  t}  =exp{-  /  \{k{t))dt}  (4) 

Jti-i 

Given  that  a  scattering  event  has  occurred  at  some  time  ti,  the  probability  density 
fimction  for  the  change  in  the  electron  wave-vector  is  expressed  as  follows: 


PkS^i) 

where  k{t~)  =  ki  and  k{tf)  =  ki+  Uj. 


S{ki,ki  +  ui) 
A(fci) 


(5) 


Equations  (l)-(5)  define  a  Markov  process  which  is  discontinuous  in  wave-vector  space 
(compotmd  Poisson  process).  In  the  theory  of  stochastic  differential  equations,  such 
a  Markov  process  is  generally  characterized  by  a  transition  density  function,  which 
satisfies  the  Kolmogorov-Feller  forward  equation,  (see  Refs.  [2]  and  [3]  for  further 
details).  By  formally  integrating  this  equation  over  the  probability  density  function  of 
the  initial  state,  one  obtains  the  linear  Boltzmann  transport  equation: 


df  -  _  .  gS(x,t)  f  -w  _ 

+  =  j  f{x,k  ,t)S{k  ,k)dk  -\{k)f  (6) 

where  /  is  the  electron  distribution  function. 

This  establishes  a  direct  connection  between  semi-classical  transport  theory  and  the 
theory  of  stochastic  differential  equations.  It  also  makes  it  possible  to  apply  the  ma¬ 
chinery  of  stochastic  differential  equations  to  semiconductor  device  modeling. 


II.  The  TVansport  Model: 

In  this  section,  we  will  present  a  transport  model  which  is  based  on  the  Boltzmann 
transport  equation.  According  to  semi-classical  transport  theory  (2),  the  total  force 
on  the  electron  is  due  to  the  electric  field  E  and  the  random  scattering  force  Fr.  The 
expected  value  of  this  random  force  can  be  computed  from  (3)-(5)  and  is  given  by  the 
following  expression: 

uS{k,k  +  u)du  (7) 

This  expression  is  in  fact  the  first  moment  of  the  transition  rate  5.  For  acoustic  and 
optical  phonon  scattering,  it  can  be  shown  that  the  expected  value  of  the  random  force 
is  given  by  the  following  expression: 

P^{k)  =  -hXik)k  (8) 

which  can  be  interpreted  as  a  drag-force  opposite  in  direction  to  the  electron  wave- 
vector.  Substituting  this  expression  into  (2),  one  obtmns  the  following  set  of  stochastic 
differential  equations: 

dib  r  -  -  -1  - 

h^=  -qE{x,t)-h\ik)k  +F^ 
dt  L 

Here,  denotes  the  zero  mean  component  of  the  random  (impulse)  force. 

In  order  to  obtain  a  transport  model  suitable  for  efiicient  numerical  implementation, 
the  integral  operator  in  the  Boltzmann  transport  equation  can  be  approximated  by  a 
second  order  differential  operator.  This  turns  out  to  be  equivalent  to  approximating  the 
zero  mean  fluctuating  force  F^  by  the  derivative  of  a  zero  mean  Wiener  process  (white 
noise).  The  variance  of  the  Wiener  process  is  taken  to  be  equal  to  the  momentum 
dependent  variance  of  the  random  force: 

=  J  UiUjS{k,k  +  u)du  (11) 

This  is  in  fact  the  second  order  moment  of  the  transition  rate  5.  The  second  order 
moments  of^the  transition  rate  can  be  computed  by  integrating  over  equal  energy 
surfaces  in  k-space  (spheroids  or  ellipsoids).  For  instance,  in  the  case  of  a  spherical 
band  structure,  the  second  moment  in  the  direction  orthogonal  to  k  is  given  by  the 
following  expression: 

=  (12) 

m 

and,  in  the  direction  parallel  to  le,  it  is  given  by 

«T*,*,(fc)  =  <r*,*Jfc)-»-|k|*A(k)  (13) 


(9) 

(10) 


hik)  =  £;{  A}  =hj 
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Here,  m  is  the  index  over  different  scattering  processes,  0  is  the  dispersion  factor  (see 
Ref.  [4])  and  is  the  radius  of  an  equal  energy  sphere  in  ib-space  after  scattering. 
Similar  relations  hold  in  the  case  of  ellipsoidal  bands  by  the  use  of  the  Herring- Vogt 
transformation  (see  Ref.  [5]).  Figs.  1  and  2  show  the  expected  value  of  the  random 
force  Fr{k)  and  the  variances  respectively,  as  a  function  of  the 

magnitude  of  the  electron  wave  vector. 

Employing  the  approximation  for  the  zero  mean  random  force  results  in  a  second  order 
partial  differential  equation  for  the  electron  distribution  function.  In  spherical  coordi¬ 
nates,  this  equation  has  the  following  form: 


%  +  i  Vje(*) .  V,/  -  = 

1/  1  52  . .  1  52  \ 

2  V drdr  r2  3686 ^  sin^  6  d<j>d4> ) 


(14) 


In  the  talk,  we  will  present  preliminary  results  on  the  computation  of  the  electron 
distribution  function  employing  the  above  model. 
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Figure  1.  The  expected  value  of  the  random  force  vs.  the  magnitude  of  the  electron 
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Figure  2.  The  variances  ffkxhxW  respectively,  as  a  function  of  the 

magnitude  of  the  dectron  wave  vector. 
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Abstract 

A  parallel  Monte  Carlo  device  simulator,  PMC-3D  has  been  developed  for  multiproces¬ 
sors.  Through  the  use  of  parallel  architectures,  full  three  dimensional  modeling  of  the 
device  domain  is  possible.  Here  a  discussion  of  the  parallel  algorithm  is  given  for  coupling 
the  Monte  Carlo  particle  simulation  with  Poisson’s  equation  for  quasi-static  problems, 
and  full  Maxwell’s  equations  for  electro-optic  devices. 


I.  Introduction 

PMC-3D  is  a  parallel  three  dimensional  (3-D)  Monte  Carlo  device  simulator  written  for 
multiprocessors[l][2].  The  parallel  algorithm  is  an  extension  of  the  standard  Monte  Carlo 
device  simulation  model  in  3D  in  which  the  particle  dynamics  generated  from  the  stochas¬ 
tic  Monte  Carlo  method  are  solved  simultaneously  with  the  appropriate  set  of  field  equa¬ 
tions  on  a  3D  mesh  using  finite  differences.  For  quasi-static  problems  such  as  the  behavior 
of  a  three  terminal  MESFET  device,  Poisson’s  equation  is  solved  for  the  potential  and 
corresponding  electric  field  used  to  drive  the  particle  dynamics.  Both  the  Poisson  solver 
and  the  Monte  Carlo  phase  are  parallelized  due  to  the  large  computational  requirements 
of  solving  Poisson’s  on  a  3D  grid.  The  implementation  of  the  Poisson  solver  is  based  on 
iterative  method  that  uses  an  odd/even  ordoing  with  Chebyshev  acceleration.  The 
code  was  developed  both  for  a  distributed  memory  1024  node  nCUBE  multicomputer  and 
a  4-node  shared  memory  Ardent  Titan  multiprocessor. 


n.  Parallel  Algorithm 

In  the  distributed  memory  implementation,  the  spatial  domain  of  the  device  is  subdivided 
onto  separate  processors  according  to  a  recursive  bisection  algorithm.  Figure  1  shows  the 
decomposition  of  the  ME)SFET  problem  for  a  simple  four  processor  case.  The  particles 
and  mesh  associated  with  each  of  the  subrepons  is  mapped  onto  separate  processors. 
The  choice  of  spatial  domain  size  is  chosen  to  roughly  balance  the  number  of  particles  per 
processor,  and  adjusts  itself  throughout  the  simulatioa  to  maintw  a  balanced  processor 
load.  The  code  is  written  in  C,  and  contains  compiler  directives  to  distribute  portions  of 
the  code  on  various  processors. 

The  parallel  to  sequential  speedup  was  characterized  in  several  ways  shown  in  Fig.  2. 
The  speedups  for  the  fixed  size  and  scaled  size  problem  were  measured  for  both  the 
Poisson  phase  and  the  Monte  Carlo  phase  of  the  code  on  a  1024  nCUBE  hypercube 
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Figure  1:  Mapping  onto  a  4  processor  hypercube  and  external  interaction  region 

with  dynamical  load  balancing  turned  off.  For  the  fully  scaled  problem,  an  efficiency  of 
69.8%  was  obtained  for  512  processors  compared  to  the  ideal  (Amdahl’s  law)  speedup. 
An  additional  increase  of  8%  was  obtained  using  dynamical  load  balancing.  In  the  fully 
scaled  case,  several  million  grid  points,  and  several  million  particles  were  simulated. 

III.  Electromagnetic  Modeling 

For  high  frequency  operation  or  for  electro-optic  devices,  Poisson’s  equation  is  insufficient 
to  correctly  characterize  the  dynamical  behavior  of  the  system,  and  thus  PMC-3D  has 
been  extended  to  include  full  electromagnetic  solutions  of  Maxwell’s  equations  coupled 
with  the  particle  transport  models.  The  Lorentz  force  equation  which  determines  the 
carrier  momentum  between  collisions,  includes  the  effect  of  the  magnetic  and  electric 
fields  obtained  from  the  time-dependent  soluti<»i  of  Maxwell’s  equations,  and  is  given  by 

F  =  hk  =  q{E-¥vxB) 

The  standard  finite  difference/time  domain  (FD/TD)  formulation  of  the  field-equations 
is  used  to  compute  the  time-evolving  quantities  EyH  and  J  with  p(z,y,r,t)  being  the 
source  term  corresponding  to  the  charges  present  as  described  by  Connolly  et  al.  [3]. 
The  algorithm  involves  two  grids  in  space  (for  E  and  H  fields)  displaced  by  half  mesh 
increments  in  the  three  spatial  directions.  Every  component  of  E  (or  H)  is  computed 
at  a  given  time,  using  the  four  adjacent  values  ci  H  (or  E)  that  contribute  to  the  loop 
integral,  that  results  from  the  Maxwell’s  curl  equations.  In  addition,  the  E  and  H  fields 
are  computed  at  time  instants  differing  by  a  half  time  step,  amounting  to  a  leapfrog 
method  of  cmnputing  the  time  evolutkm  of  the  field  quantities. 


Figure  2:  Parallel  to  sequential  speedups  of  PMC-3D 


The  current  density  term  J  is  central  to  the  feedback  that  exists  between  the  Monte 
Carlo  part  and  the  field  computations.  It  is  computed  once  every  timestep  at  every  grid 
point  by  sununing  the  weighted  velocities  of  the  particles  in  the  ensemble  that  lie  within 
a  unit  cell  volume  about  the  grid  point,  and  is  given  as 


9 

AxAy^z 
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where  Sn  &nd  v„  refer  respectively  to  the  charge  and  velocity  of  the  nth  particle  associated 
with  the  grid  pmnt  [3]. 

The  electric  fields  ate  initialized  to  the  values  obtained  using  a  one-time  solution  of 
Poisson’s  solution,  corresponding  to  the  applied  DC  bias  voltages  on  the  strip  line,  while 
the  magnetic  fields  are  assumed  to  be  zero.  Dirichlet  boundary  conditions  are  imposed 
on  the  metal-semiconductor  boundaries  as  well  as  on  all  the  conducting  walls.  Some 
preliminary  results  have  also  been  obtained  by  impoeing  Netunann  boundary  conditions 
at  the  semiconductor-air  interfaces. 

The  FD/TD  algorithm  essentially  requires  the  solution  of  an  identical  set  of  difference 
equations  at  every  grid  p<wt  and  hence  is  ideally  suited  for  parallel  implementation.  In 
addition  to  updating  the  electric  and  "  ,'^ietic  fields  ci  the  grid  points  belonging  to  the 
subgrid  m^ped  to  its  local  memory,  every  processor  also  needs  to  communicate  with  its 
nd^boring  processors  via  message  passing  to  obtain  the  electric  or  magnetic  field  values 
that  may  be  required  in  the  onnputatkHi  of  fidds  along  the  boundaries  of  subgrids. 

The  program  flow  during  a  typical  timestep  in  each  processor  starts  off  with  the 
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computation  of  electric  fields  «t  time  t.  (after  computing  the  current  density  vector  J 
at  each  grid  pmnt  at  time  t»),  followed  by  evolution  erf  the  particles  in  momentum  and 
real  space  for  half  a  timestep.  After  computing  the  magnetic  fields  at  time  tn  +  ^  using 
the  electric  field  values  at  the  ensemble  is  allowed  to  evolve  for  another  half  timestep, 
thereby  completing  one  timestep  of  Monte  Carlo  particle  evolution. 

One  such  structure  that  was  modeled  is  the  photoconductive  switch  [4]  [5]  shown  in 
Fig.  3.  As  shown  there,  a  femtosecond  laser  pulse  is  used  to  generate  electron  hole  pairs 


Figure  3:  Elxperimental  configuration  of  photoconductive  switching 

between  two  DC  biased  coplanar  striplines  on  a  GaAs  bulk  or  superlattice  substrate.  The 
transient  dynamical  response  of  the  electrons  and  holes  as  they  accelerate  in  opposite 
directions  induces  a  subpicosecond  time  scale  transient  electrical  pulse  in  the  coplanar 
waveguide  which  propagates  down  the  stripline  and  is  subsequently  detected  at  a  different 
point.  Through  tMs  technique,  ultra-short  time  scale  electrical  pulses  may  be  generated 
and  detected  optically.  The  gap  separating  the  microstrip  lines  is  10  ftra,  while  the  strips 
are  2/im  wide.  The  GaAs  semiconducting  layer  is  1.2  pm  thick,  and  15pm  long  along  the 
z-direction.  A  20  femtosecond  laser  pulse  with  an  energy  of  1.55  eV  is  assumed  with  a 
spatial  width  of  2pm. 

Figure  4  shows  the  simulated  particle  current  for  a  fairly  low  (peak)  injection  density  of 
1 X  10^‘  /cm^.  The  results  obtain^  from  using  FD/TD  solutions  of  Maxwell’s  equations  as 
well  as  the  case  for  which  only  the  Poisson’s  equation  was  solved  for  updating  the  fields, 
are  shown  for  comparison.  It  is  seen  that  there  is  a  reasonable  qualitative  agreement 
at  low  density  between  the  quasi-static  solution  and  the  more  complete  model  where 
velocity  overshoot  is  directly  observed.  As  the  denrity  is  increased,  the  perturbation  of 
the  DC  electric  fields  due  to  the  higher  density  of  the  moving  charges  results  in  significant 
modification  of  the  time  dependent  magnetic  and  electric  fields.  At  very  high  densities, 
velocity  overshoot  is  barely  observed,  with  additional  effects  due  to  reflections  of  the 
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Figure  4:  Simulated  particle  current  for  Ni  =  lxlO*’/cm^ 


electromagnetic  fields  from  the  boundary  walls  coming  into  play.  Comparison  of  the 
effects  of  absorbing  versus  reflecting  boundary  conditions  are  currently  being  studied. 
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Abstract 

In  this  paper  two  unified,  scalable  and  transportable  parallel  approaches  to  the 
numerical  simulation  of  semiconductor  devices  are  presented:  Concurrent  Device 
Simulation  (CDS),  and  Spatial  Device  Decomposition  (SDD).  Both  approaches  have 
been  developed  and  tested  on  the  Parsytec  Supercluster  Model  64  which  is  a  medium 
size  transputer  system.  A  series  of  examples  illustrate  the  application  of  the  developed 
parallel  simulation  tools. 


I.  Introduction 

The  rapid  progress  in  device  technology,  which  made  possible  the  fabrication  of  nanometer 
scale  band  engineered  structures  [1],  is  now  having  a  considerable  impact  on  device  simulation. 
New  physical  phenomena  such  as  carrier  heating,  ballistic  transport  and  quantisation  govern  the 
behaviour  of  nano-scale  devices  [2]  and  require  more  complex  and  expensive  models  for 
simulation.  For  nano-structure  devices,  full  3D  simulation  has  becoming  a  necessity.  The  need 
for  super  computer  power  often  restricts  both  the  complexity  of  physical  models  involved  in 
device  simulation  programs  and  their  widespread  application.  In  the  foreseeable  future,  a 
significant  low-cost  improvement  in  computing  performance  will  only  be  available  through 
Multiple  Instruction  Multiple  Data  (MIMD)  systems  where  necessary  speed-up  derives  from  dte 
use  of  parallel  processors  sharing  a  large  distributed  memory.  Hence  the  parallel 
implementation  of  device  simulation  codes  has  become  essential  [3].  Such  fast  parallel  codes 
co^d  be  used  for  real  device  optimisation,  sensitivity  analysis  and  yield  prediction  in  both 
research  and  industry.  However,  to  be  of  practical  use,  parallel  device  simulation  programs 
must  be  unified,  scalable  and  portable. 

We  report  on  two  parallel  approaches  to  numerical  simulation  of  semiconductor  devices, 
developed  in  the  Nanoelectronic  Research  Centre  at  Glasgow  University:  Concurrent  Device 
Simulation  (CDS),  and  Spatial  Device  Decomposition  (SDD).  Both  approaches  are 
implemented  on  the  departmental  transputer  system  -  the  Parsytec  Supercluster  Model  64  - 
which  is  a  medium  class  MIMD  system.  It  consist  of  64  electronically  reconflgurable  T800 
transputers  with  4MB  of  local  memory  per  processor. 


IL  Concurrent  Device  Sinnilation 

Any  pr^tical  simulation  run  can  cover  a  large  matrix  of  input  data  parameters  from  bias  points 
to  details  of  (tevice  design  such  as  gate  length,  vertical  layer  structure,  doping  distribution, 
recess  shape  etc.  A  simple  but  effective  form  of  parallel  tfevice  simulation  is  lb  run  several 
copies  of  die  serial  simulation  code  concurrendy,  implementing  Single  Program  Multiple  Data 
(SPMD)  computational  model  [4].  To  this  end  a  fileserver  has  been  design^  to  pass  input  data 
sets  to  a  pipeline  of  processors  running  our  serial  H2F  simulator  [5]. 

The  fileserver  is  split  into  two  parts:  the  server  which  is  system  dependent  and  the  harness 
which  is  application  dependent.  The  server  runs  on  a  simple  pipeline  of  processors  (fig.  1). 
First  it  runs  a  pre-processing  routine.  Then  it  collects  input  data  packets  from  a  send-data 
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routiiie,  |MSses  them  to  a  free  processor  on  which  a  process-data  routine  (the  simulator  H2F  in 
our  case)  runs,  collects  data  from  the  process-data  routine  and  returns  them  to  the  master 
processor  for  a  receive-data  routine.  This  cycle  is  repeated  until  all  the  data  packets  have  been 
processed  and  then  Utc  server  runs  a  post-processing  routine. 


Fig.  1  Pipeline  fileserver  calculating  in  parallel  Fig-  2  Time  tN  required  to  pass  set  of  input 
an  I*V  characteristic  data  to  the  last  processor  of  an  N- 

processors  pipeline 

The  server  is  implemented  using  a  simple  set  of  programs  and  can  easily  oe  adapted  for  a 
number  of  platforms  e.g.  a  fast  serial  processor;  Inmos  transputer  boards  running  3L 
languages;  a  Parsytec  Multicluster  32/Supercluster  64  running  Helios/Parix  or  a  Meiko 
computing  surface  of  transputers  /C40s  running  CDL.  The  server  links  to  a  harness  of  five 
routines  -  pre-process,  process,  post-process,  send-data  and  receive-data  which  are 
independent  of  the  system  and  completely  portable.  For  long  computational  jobs  the  fileserver 
has  proved  superior  to  conventional  farm  processing  techniques  supported  by  many  of  the 
existing  parallel  languages  because  it  can  be  reliably  scaled  and  is  not  restricted  by  the  platform 
dependant  variations  in  buffering  systems. 

The  communication  time  overhead  associated  with  the  fileserver  is  illustrated  in  Fig.  2  where 
the  measured  access  time  to  the  last  processor  is  plotted  as  a  function  of  the  fileserver  length. 
Since  a  single  bias  point  calculation  with  H2F  takes  approximately  one  hour  or  more,  the 
communication  ovei^ead  is  negligible.  However  the  average  access  time  can  be  significantly 
reduced  on  many  of  the  configurable  transputer  systems  if  a  ternary  tree  is  used  instead  of  a 
pipeline.  The  effectiveness  of  the  CDS  when  single  bias  data  are  calculated  on  each  processor  is 
also  restricted  by  two  other  factors.  First,  brcause  the  simulation  time  for  different  bias 
conditions  can  vary  significantly,  the  total  execution  time  is  determined  by  the  processor  with 
the  worst  combination  of  bias  points.  Secondly,  the  techniques  of  extrapolating  the  initial  guess 
from  previous  solutions  is  inapplicable.  The  second  disadvantage  may  be  reduced  by 
calculating  sets  of  bias  points  on  a  each  processor. 

Investigations  of  the  influence  of  the  structure  parameters  on  the  device's  performance 
has  proved  to  be  extremely  amenable  to  CDS.  Fig.  3  shows  a  set  of  I-V  characteristics  for  a 

100  nm  gate-lmigth  5-doped  pseudomorphic  HEMT,  calculated  in  parallel  with  a  CDS  on  60 


processors,  with  variable  5-doping  and  gate  length.  The  estimated  speed-up  for  this  particular 
simulation  is  around  30  which  reflects  approximately  50%  efficiency. 


Cbannel  length,  nm  300  Iqq 


Vd  [v]  Vd  [V] 


Fig.  3  Set  of  I-V  characteristics  for  4  different  PHEMTs  calculated  simultaneously 
using  the  CDS  approach 


III.  Spatial  Device  Decomposition. 

The  SDD  approach  is  a  powerful  method  of  accelerating  a  single  bias  point  device  simulation 
and  of  overcoming  the  inherent  memory  limitation  of  3D  simulations.  The  basic  idea  of 
implementing  this  approach  on  MIMD  is  illustrated  in  Fig.  4  where  the  spatial  device 
decomposition  of  a  FEl'  and  its  corresponding  grid  partition  is  sketched. 


Gate  L . ;l  Buffar  [  I  Subatrale 


(a)  (b) 

Fig.  4  Spatial  device  decomposition,  (a)  physical  representation  (b)  grid  partitioning 


The  solution  domain  of  the  device  being  simulated  is  spread  over  a  large  number  of  processors 
(Hg.  4.  (a)).  If  the  solution  domain  is  topological  rectangular  (as  in  H2F  [5])  the  best 
cm^guration  is  an  array  of  NxM  processors.  In  this  case  each  processor  is  allocate  to  the  grid 
points  corresponding  to  one  device's  subdomain  plus  the  grid  points  from  the  boundary  of  the 
nei^bouring  processors  subdomains  (Fig.  4  (b)). 


Fig.  5  Implementation  of  a  black/red  SOR  Fig.  6  Speed-up  of  the  parallel  SOR  solver  for 
solver.  Black  (■)  and  red  (•)  nodes  an  array  and  pipeline  of  49  processors: 

are  updated  simultaneously  _ theory,  □  •  measurements 
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Fig.  7  Potential  distribution  normal  to  the  Fig.  8  Potential  distribution  around  two 
channel  of  an  IPGT  with  gate  aluminium  wires  on  etched  p-Si 

isolatimi  created  by  ion  implantation  pedestals  for  design  of  Coulomb 

blockade  devices  [8] 

To  ensure  portability,  our  parallel  simulation  system  is  divided  into  two  parts:  a  system 
dependent  communication  harness  and  a  system  independent  simulation  harness.  The 
developed  communication  harness  for  the  Parsytec,  GARH  (General  ARray  Harness)  provides 
all  global  and  local  communications  needed  to  implementat  the  device  simulation  code  on  an 
array  of  transputers.  The  grid  generation  is  performed  on  the  'root'  transputer,  then  the  device 
is  partitioned  and  distributed  over  the  complete  array.  All  processors  perform  Uie  data 
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initialisation  and  discretization  in  parallel.  The  key  point  is  the  parallel  solution  of  the  algebraic 
system,  arising  from  the  discretization  of  Poisson's  and  the  current  continuity  equations.  For 
the  discretized  Poisson's  equation,  a  black/red  SOR  solver  was  implemented  (Fig.  S). 
Applying  a  recently  developed  performance  theory  for  the  speed-up  of  the  transputer  array 
based  iterative  solvers  [6],  the  experimentally  observed  and  predicted  speed-up  is  given  in  Fig. 
6.  This  Hgure  clearly  indicates  that  the  array  implementation  minimises  the  ratio  between  the 
toundary  and  bulk  subdomain  grid  points  and  hence  the  local  communications  overhead  is 
superior  to  the  pipeline  implementation.  The  development  of  a  processor  array  current 
continuity  solver  is  in  progress. 

Two  examples  of  parallel  simulations  of  nanostructure  devices,  based  only  on  the  parallel 
solution  of  Poisson  equation  are  given  in  Fig.  7  and  8.  The  first  figure  represents  the  potential 
distribution  normal  to  the  channel  of  an  InPlane  Gate  Transi.stor  (IPGT)  [7]  with  a  gate 
isolation  created  by  ion  damaging.  The  second  figure  represents  the  calculation  of  the  capaciuve 
coupling  between  two  aluminium  wires  on  the  top  of  an  etched  p-Si  substrate  to  aid  the  design 
of  Coulomb  blockade  devices  [8]. 


IV.  Conclusions 

Two  unified,  scalable  and  portable  approaches  have  been  developed  for  the  intensive  simulation 
of  semiconductor  devices  on  medium-sized  MIMD  systems.  The  Concurrent  Device  Simulation 
approach  is  simple  to  implement  but  leads  to  a  reasonable  efficiency  only  when  numerous  sets 
of  input  data  are  investigated  in  the  process  of  device  design  and  optimisation.  The  Spatial 
Device  Decomposition  approach  is  more  complicated  but  can  accelerate  a  single  point 
simulation.  However,  the  successful  implementation  of  the  second  approach  depends  on  the 
development  of  adequate,  effective,  scalable  and  portable  linear  equation  solvers  capable  of 
dealing  with  the  difficulties  in  solving,  for  example,  the  discretized  current  continuity  equation. 
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Summary 

A  parallel  implementation  of  the  numerical  simulation  of  a  GaAs  MESFET  device  using  the 
finite  difference  discretisation  scheme  and  solved  by  a  point  iterative  method  is  presented. 
Parallel  techniques  targeted  at  Multiple-Instructions  Multiple-Data  (MIMD)  message-passing 
distributed  memory  architectures,  in  particular.  Transputers,  are  described.  Efficient 
parallelism  is  achieved  by  the  geometric  decomposition  of  the  problem  domain.  Issues  on  the 
convergence  and  efficiency  of  the  solution  with  novel  strategies  such  as  iteration  ordering 
techniques  and  communication  protocols  in  a  parallel  environment  are  discussed.  The  parallel 
implementation  of  the  adaptive  grid  refinement  also  requires  load  balancing  techniques  for 
optimum  efficiency.  Performance  results  of  the  paraillel  simulation  using  these  techniques  are 
presented. 


1.  Introduction 

Physical  models  are  now  widely  used  for  simulating  complex  semiconductor  devices.  The 
increasing  complexity  of  the  device  models  requires  high  performance  computers  particularly 
for  interactive  device  characterisation.  With  the  advent  of  relatively  cheap  parallelism  in  the 
form  of  Transputers  [1],  a  building  block  for  a  multiprocessor  MIMD  Distributed  Memory 
Parallel  System,  the  computational  requirements  for  fast  device  characterisation  can 
theoretically  be  met.  Parallel  processing  offers  attractive  advantages  such  as  scalable 
performance  and  superior  cost/performance  ratio.  While  parallel  hardware  is  readily  available, 
there  still  is  a  need  for  robust  and  efficient  parallel  software  for  semiconductor  device 
simulation. 


2.  The  Device  Simulation  Problem 

This  work  covers  the  numerical  simulation  of  a  typical  n-channel  Metal  Semiconductor  Field- 
Effect  Transistor  (MESFET)  using  a  physical  modelling  approach.  A  simplified  Drift  / 
Diffusion  Transport  model  with  the  Scharfetter-Gummel  formulation  for  current  density  has 
been  used.  The  numerical  solution  was  achieved  using  the  finite  difference  discretisation 
scheme.  The  basic  semiconductor  equations  consisting  of  the  closely  coupled  poisson  and 
continuity  equations  are  solved  by  the  Gauss-Seidel  point  iteration  method  with  successive 
relaxation  for  time-dependent  solution  [2].  An  adaptive  refinement  strategy  is  implemented  for 
numerical  accuracy  and  optimising  computer  resources. 
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3.  The  Parallel  Solution  •  System  and  Algorithms 

The  parallel  system  used  consists  of  an  array  of  16  TRAnsputer  Modules  (TRAMs)  which 
forms  a  General  Purpose  MIMD  [3]  (GP-MIMD)  message-passing  distributed  memory  parallel 
system.  A  high  level  portable  language.  Parallel  ANSI  C,  was  used  to  code  the  algorithms. 
The  device  simulation  problem  has  been  parallelised  using  a  one  dimensional  geometric 
decomposition  [4].  Each  processor  is  assigned  a  slice  of  the  domain  which  is  stored  locally  on 
each  TRAM  module's  memory  (e  g.  Figure  1  shows  4  transputer  modules  and  4  subdomains). 
This  is  easily  scalable  to  n  transputers  where  n  is  less  than  the  no  of  grid  points  horizontally. 


Oiain  Gae  Soiree 
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TRMio  mmi  TIM1112 


Figure  1.  Parallel  Decomposition  of  the  Device  Simulation  Problem 

The  parallel  system  configuration  of  both  hardware  and  software  and  associated 
communication  channels  is  shown  in  figure  2.  The  transputers  are  connected  in  a  ring  network 
topology.  A  graphics  process  enables  the  real-time  visualisation  of  the  solution  . 


Figure  2.  Parallel  Hardware  and  Software  Configuration 


We  now  introduce  the  major  overheads  in  the  parallel  simulator  as  opposed  to  a  sequential 
implementation.  The  slave  processes  are  controlled  via  channels  1  &  2  (control  protocol)  by  a 
driver  process.  The  latter  instructs  the  slaves  to  perform  specific  task  and  monitors  the  state  of 
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the  simulation  such  as  the  gfobal  convergence  of  the  iterative  solvers  and  refinement  decisions. 
The  five-point  formula  used  for  solving  the  poisson  and  continuity  equations  requires  four 
neighbouring  data.  HoAvever,  at  subdomain  boundaries,  a  column  of  node  data  is  missing  and 
has  to  be  transmitted  via  channels  0  &  3  (data  exchange  protocol). 

The  decomposition  of  the  MESFET  domain  introduces  convergence  and  stability  problems  of 
the  numerical  solution.  For  the  poisson  and  continuity  iterative  solver,  the  update  ordering  is 
modified  when  domain  is  partitioned.  A  novel  red/black  checker-board  updating  technique 
named  as  the  SOR  ID  [S]  partitioning  method  that  ensures  the  stability  and  provides 
optimal  convergence  of  the  numerical  solution  was  developed. 


Synchronous  Asynchronous 


Figure  3.  Data  Exchange  Contmunication  Protocols 


Data  exchange  communication  protocol  which  ensures  data  consistency  across  boundaries  can 
broadly  be  categorised  as  synchronous  and  asynchronous  [6].  Figure  3  depicts  the  two  types 
of  protocols  implemented.  Synchronous  communication  ensures  that  iteration  processes  are 
synchronised  and  boundary  data  are  updated  in  a  orderly  fashion.  Asynchronous 
communications,  on  the  other  hand,  enable  the  communication  processes  (TXP  &  RXP)  to  be 
completely  independent  of  the  computation  process  (ITP).  This  protocol  provides  less 
synchronisation  overheads  but  data  consistency  at  each  iteration  cannot  be  predicted. 


4.  Parallel  Adaptive  Meshing  and  Load  Balancing 

A  refinement  algorithm  based  on  the  potential  difference  criteria  was  implemented  on  the 
parallel  simulator.  The  dynamic  refinement  of  the  domain  results  in  an  imbalance  of  work 
loads  on  each  processor.  Load  balancing  techniques  based  on  a  quasi-dynamic  [7]  strategy 
were  designed  to  achieve  optimum  performance.  Figure  4  shows  the  allocation  of  the  sub- 
domains  when  a  network  of  4  transputers  has  been  load-balanced. 
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Figure  4.  Adaptive  meshing  and  Load  Balancing 
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5.  Results  and  Performance 


These  results  and  performance  data  were  obtained  from  the  parallel  system  by  emulating  a 
GaAsMESFET  with  0.  ISp.m  active  channel  (I.5e23  m*^)  and  O.S5|im  gate  length  using  dynamic 
adaptive  grid  meshing.  The  solution  for  the  electron  concentration  profile  is  shown  in  figure  5 
for  a  bias  condition  shown  in  figure  4 


Figure  5.  Electron  Concentration  Profile  for  GaAs  MESFET 

An  overview  of  the  overall  performance  of  the  parallel  system  is  shovm  in  figure  6.  The  drop 
in  speed  up  is  attributed  to  the  communication  overheads  in  the  parallel  system.  The  graph 
shows  the  parallel  performance  of  the  solution  for  a  fixed  grid  of  128x32  A  speed  increase  of 
up  to  a  factor  of  12.5  for  synchronous  protocol  on  a  16  transputer  network  has  been  achieved. 
For  the  asynchronous  protocol  depicted  in  figure  3,  lower  performance  is  observed  due  to 
excessive  communication  overheads. 
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Figure  6.  Overall  Parallel  Performance  Results  for  fixed  grid 

Figure  7  shows  the  parallel  performance  for  the  parallel  adaptive  meshing  algorithm  of  the 
GaAsMESFET  device  (from  figure  4  &  5)  starting  with  a  grid  of  41x10  to  62x14  over  a  time 
of  SOfs.  Due  to  the  small  grid  size,  the  communication  overheads  are  significant  leading  to 


decreasing  performance  as  more  processors  are  used  The  importance  of  dynamic  load 
balancing  is  clearly  shown  from  these  results  (using  synchronous  and  RB  SOR  ID  techniques). 
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Figure  7.  Overall  Parallel  Performance  Results  for  Adaptive  Grid 


6.  Conclusion 

A  parallel  numerical  solver  for  an  iterative  GaAs  MESFET  device  solver  has  been  presented. 
Geometric  domain  decomposition  proves  to  be  the  natural  way  of  parallelising  the  simulation 
enabling  a  logical  map  on  transputer-based  systems.  Combined  with  Parallel  C,  this  provides  a 
good  platform  for  developing  parallel  algorithms.  However  the  overall  code  complexity  and 
size  increases  in  the  parallel  implementation.  The  convergence  and  stability  behaviour  of  the 
solution  is  also  found  to  be  affected  in  a  parallel  environment.  This  led  to  the  development  of 
suitable  iteration  update  ordering  techniques  which  ensure  fast  convergence  and  stability  of  the 
solution.  Communication  overheads  remain  the  major  factor  limiting  the  efficiency  of  the 
parallel  simulator.  The  parallel  grid  refinement  provided  very  efficient  computation,  with  the 
optimum  grid  structure  automatically  generated  but  requires  load  balancing  techniques  to 
improve  parallel  efficiency.  Overall,  the  performance  of  the  parallel  solver  is  scalable  for 
increasing  problem  size  and  numerical  complexity.  It  is  envisaged  that  the  parallel  techniques 
developed  will  have  a  wide  application  on  exceptionally  computer  intensive  semiconductor 
device  simulation  as  well  as  on  fast/real-time  device  characterisation. 
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Abstract 

In  this  paper  we  present  four  different  SOR  variants  for  solving  the  3D  Poisson  equation  on 
an  array  of  transputers.  The  performance  of  all  variants  has  been  tested  and  compared  in 
terms  of  number  of  iterations  and  global  computation  time  for  different  configurations  of  the 
transputer  system.  The  parallel  performance  of  the  algorithms  has  also  been  evaluated  and 
compared  to  a  theoreticaJ  speed-up  model. 


1.  Introduction 

The  realistic  semiconductor  device  simulation  (both  classical,  Monte  Carlo  or  quantum 
mechanical)  in  many  cases  requires  a  3D  solution  of  the  Poisson  equation  and  leads  to 
enormous  problem  sizes  [1].  The  single  processor  implementation  of  the  corresponding  3D 
codes  is  limited  by  both  the  processor  speed  and  the  huge  memory-access  bottleneck.  In  the 
foreseeable  future  a  significant  low  cost  improvement  in  computer  performance  will  only  be 
available  through  Multiple  Instruction  Multiple  Data  (MIMD)  systems  (many  of  them 
transputer  based),  for  which  the  necessary  speed-up  derives  from  the  use  of  parallel 
processors  sharing  a  large  distributed  memory.  The  point  and  block  Successive  Over 
Relaxation  (SOR)  methods  are  promising  candidates  for  3D  parallel  implementation  on  such 
computers.  Although  the  recursive  character  of  the  original  SOR  method  seems  to  be  a 
serious  impediment  [2],  for  a  large  class  of  linear  systems  arising  from  finite  difference,  and 
in  particular  cases  [3]  from  finite  element  approximations  of  the  Poisson  equation,  the 
multicolour  ordering  of  the  grid  points  leads  to  easily  parallelizable  versions  of  the  SOR 
method  [4,5]. 

Here  we  present  a  systematic  approach  to  the  parallel  implementation  of  scalable  point  and 
block  black/red  3D  SOR  Poisson  solvers  on  a  2D  arrays  of  transputers  for  the  purposes  of 
semiconductor  device  simulation.  Utilising  the  power  and  the  flexibility  of  the  Parsytec 
Supercluster  Model  64  a  wide  range  of  experiments  have  been  made  both  to  compare  the 
performance  of  the  different  SOR  variants  and  to  choose  the  optimum  2D  transputer 
configuration  for  mapping  3D  problems.  The  recently  developed  detailed  performance  theory 
[6]  has  been  applied  to  underpin  the  experimental  solver  design. 


IL  Model  Problem  and  Partition 


(1) 


The  Poisson  equation  used  in  the  semiconductor  device  simulations 

p 

dx^  dy^  dz^  £ 
may  be  nonlinear  or  linear,  depending  on  whether  the  charge  density  p  is  a  function  of  the 

electrostatic  potential  *F  or  not  In  the  drift-diffusion  and  the  hydrodynamic  simulation 
approaches  it  is  mainly  used  in  a  nonlinear  form  .  In  Monte  Carlo  and  quantum  mechanical 
simulation  however  it  is  usually  enough  to  solve  the  linear  Poisson  equation. 
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In  this  study  for  simplicity  and  clarity  we  consider  the  linear  3D  Poisson  equation  in  a  unit 
cube  Q  with  Dirichlet  boundary  conditions.  A  uniform  grid  is  used  to  discretize  the  region  Q 
into  12b.  The  approximation  of  the  second-order  derivatives  by  the  second-order  accurate 
central  differences  on  12h  leads  to  a  set  of  nxnxn  algebraic  equations 


Puj. 


(2) 


for  ij,k  s  2,3,  ...n-l.  where  n  is  the  number  of  points  on  a  side  of  Qb,  and  h  is  a  coefficient 
depending  on  n..  In  most  of  the  following  experiments  yrwas  set  to  one,  p  was  set  to  zero  and 
the  initial  conditions  were  vr  =  0. 


Contieti  BB  Cap  i  i  Aciiva  layar 
Qata  t,  ■■  il  BuKar  '.,1  Subatiata 


Slice  (1x5) 


Reciaii(ular  (2x5) 


Square  (5x5) 


Fig.  1  Mapping  of  a  3D  semiconductor  device  Fig.  2  Partition  of  a  2D  discretization  grid  on 
simulation  domain  on  an  2D  array  of  a  2D  array  of  transputers 

transputers 

Although  it  is  intuitively  clear  that  the  best  environment  for  solving  topologically  rectangular 
3D  problems  is  a  3D  array  of  processors,  we  are  restrict  to  a  2D  array  of  NxM  transputers. 
This  reflects  the  connectivity  of  transputers,  which  have  only  four  linlu,  and  offers  a  simple 
way  of  mapping  a  topologically  rectangular  3D  grid,  where  the  overlap  of  subdomain 
boundaries  is  essential.  The  transputers  have  a  natural  ordering  p  =  1,2,...A^  and  q  =  l,2,...Af 
Fig.  1.  One  additional  ‘root’  transputer  performs  all  management  and  synchronisation.  The 
grid  is  partitioned  into  NxM  subdomains  along  two  of  the  spatial  dimensions  (i  j)  and  each  of 
die  subdomains  involves  all  corresponding  points  in  the  third  dimension  k..  Examples  of  slice 
(IxAf),  rectangular  {NtiM)  and  square  {NxN)  partitions  are  given  in  Fig.  2.  A  universal 
communication  harness  GARH  [6]  supports  all  necessary  global  and  local  communications. 


in.  Back/Red  SOR  Variants 

The  implementation  of  the  Point  SOR  method  with  Natural  ordering  in  ij  directions  on  each 
processor  may  in  many  cases  cause  divergence  for  problems  which  behave  well  in  a  serial 
implementation  [7].  This  may  be  avoided  if  a  blacl^red  SOR  variant  is  considered.  In  this 

approach  each  partition  sub  domain  is  decomposed  into  two  further  subdomains  -  black 
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and  red  ■  The  nodes  in  the  black  and  red  subdomains  are  updated 

simultaneously  and  the  overlapping  boundaries  are  exchanged  before  each  updating. 

When  a  2D  array  of  processors  is  used,  it  is  enough  to  apply  the  black/red  ordering  only  in  the 
ij  plane  which  leaves  some  degree  of  freedom  in  the  i  direction.  To  explore  this  freedom  we 
consider  four  different  parallel  SOR  schemes  namely: 

PSORBRyNk  •  Point  SOR  with  Black/Red  ordering  in  ij  direction  and  Natural  ordering  in  k 
direction,  ^ig.  3  a) 

PSORBRyk  -  Point  SOR  with  Black/Red  ordering  in  all  three  directions  (Fig.  3  b). 

PSORBRyAk  -  Point  SOR  with  Black/Red  ordering  in  ij  direction  and  Alternating  directions 
in  k  direction. 

BSORB^Tk  -  Block  SOR  with  Black/Red  ordering  in  ij  direction  and  tridiagonal  equations 
solution  in  k  direction. 


i 


Table  2 


Mesh  size  n 

■BSiXS 

■ones 

10 

0.087 

0.093 

0.087 

0.087 

0.106 

0.077 

20 

0.635 

0.635 

0.6.35 

0..595 

0.677 

0..552 

30 

2.184 

2.184 

2.184 

1.998 

2.346 

1.985 

40 

■KESIBH 

5.539 

5.4.50 

5.003 

5.888 

4.828 

50 

11.278 

11.582 

11.278 

10.668 

12.163 

10.169 

The  main  disadvantage  of  the  black/red  ordering  wdth  all  variants  described  above  is  that  the 
natural  ordering  SOR  theory  [7]  does  not  hold,  which  makes  it  difficult  to  estimate  the 

optimum  relaxation  coefficient  (O  a  priory.  In  order  to  compare  more  precisely  our  four 

different  SOR  variants,  experimental  optimum  values  for  ov  have  been  found  by  using  a  one 
dimensional  minimum  search  based  on  the  Golden  Section  method.  In  Table  1  the  numbers  of 
iterations  providing  an  accuracy  5=0.001  are  given  for  both  (Oe  and  the  theoretically  predicted 


Table  1 


Mesh  size  n 

PSORBR 

PSORBRN 

PSORBRA 

PSORBRT 

10 

16 

15 

15 

15 

11 

20 

32 

32 

32 

30 

22 

30 

47 

47 

47 

43 

33 

40 

62 

61 

61 

.56 

41 

76 

74 

74 

70 

51 

Fig.  3  Black/red  ordering  in  the  3D  case, 
(a)  full  black/red  ordering  (b) 
black/red  ordering  only  in  i,j  plane 
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relaxation  coefficients  (Ob  [7].  As  can  be  expected  the  block  variant  BSORBRT  leads  to  a 
significant  reduction  in  the  number  of  iterations  but  at  the  expenses  of  an  increasing  number 
of  calculations  per  grid  point.  This  point  becomes  clear  from  Table.  2  where  the  execution 
times  in  seconds  are  compared  when  all  64  transputers  of  a  Parsytec  Super  Cluster  were  used, 
conHgured  in  an  8x8  array. 


IV  Speed-up  analysis 

To  evaluate  the  parallel  potential  of  the  considered  SOR  variant  and  the  optimal  processor 
conflguration  we  use  the  relative  speed-up  defined  as  the  ratio  between  execution  times  on  an 
array  of  processor  and  the  execution  time  on  a  single  processor.  It  is  obvious  that  according  to 
this  definition  the  maximal  available  speed-up  is  equal  to  the  number  of  processors  on  which 
the  algorithm  runs.  The  speed-up  model  developed  in  [6]  for  scalable  2D  linear  solvers 
implemented  on  an  array  of  transputers  was  extended  to  the  3D  case  and  takes  into  account 
the  details  of  the  solution  domain  partition,  all  global  and  local  communication  overheads  and 
the  computation  time  in  the  linear  and  nonlinear  case.  All  parameters  of  the  performance 
theory  have  been  extracted  from  independent  measurements. 
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Fig.  4  Speed-up  as  a  function  of  the  problem 
size  for  different  configurations  of  64 
transputers  (PSORBRN  variant) 


Fig.  5  Balance  between  calculation  time  and 
local  communication  time  in  the  case 
of  8x8  transputers  (PSORBRN) 


A 


The  measured  and  calculated  speed-up  for  three  different  configurations  of  64  transputers  is 
given  in  Fig.  4  for  the  PSORBRN  variant  as  a  function  of  the  problem  size.  The  picture  shows 
very  good  agreement  between  the  measured  and  predicted  performance  even  in  the  finest 
details.  The  square  partition  which  minimises  the  local  communications  shows  the  best 
performance  for  problem  sizes  n  which  are  divisible  by  the  transputer  array  size.  Fig.  5  gives 
an  idea  for  the  balance  between  the  local  communication  and  calculation  time  for  a  single 
iteration  in  the  8x8  case.  Finally  Fig.  8  illustrates  how  the  performance  theory  may  be  used  to 
estimate  the  expected  sp^d  up  for  a  particular  problem  size  (n=S0)  mapped  on  a  large 
number  of  transputers.  It  is  clear  that  for  large  transputer  systems  the  BSORBRT  variant  is 
superior  to  all  other  variants  because  of  the  higher  calculations/  communications  ratio. 
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Number  of  transputers  NN 


Fig.  6  Speed-up  of  50x50x50  problem  on  a 
large  array  of  transputers 


V.  Conclusions 

Four  variants  of  a  black/red  SOR  method  for  solving  the  3D  Poisson  equation  on  array  of 
transputers  have  been  implemented  and  tested.  The  behaviour  of  PSORBR  and  PSORBRN 
variants  ate  very  similar  in  terms  of  iterations  required  to  achieve  a  certain  accuracy.  The 
third  point  variant  PSORBRA  slightly  reduces  the  number  of  iterations.  The  block 
BSORBRT  variant  reduces  the  number  of  iteration  by  more  than  25%  but  at  the  expense  of  a 
larger  calculation  time  per  iteration.  Although  for  a  medium  size  transputer  system  all  four 
variants  are  very  similar  in  terms  of  global  computational  time,  our  speed-up  analysis  shows 
that  on  a  large  array  of  transputers  the  advantages  of  BSORBRT  will  become  more 
pronounced. 
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Abstract 

A  first  principles  LDA  approach  based  on  the  Harris  functional  ansatz  is  used  to  in¬ 
vestigate  the  electron-phonon  interaction  in  silicon,  within  the  rigid  ion  approximation. 
Electron-phonon  matrix  elements  for  transitions  between  selected  electronic  states  are 
calculated,  and  used  to  generate  band-  and  wavevector-dependent  scattering  rates.  These 
scattering  rates  are  gmerally  larger  than  those  calculated  with  other  methods,  and  show 
significant  anisotropy. 

1.  Introduction 

The  concept  of  applying  pseudopotential .  to  the  analysis  of  electron-phonon  interac¬ 
tions  in  solids  [1]  has  served  as  an  invaluable  tool  in  our  theoretical  understanding  of 
the  process.  Without  pseudopotentials,  one  currently  either  relies  on  a  restricted  scope 
of  measurable  information,  or  is  relegated  to  phenomenology.  Primarily  for  purposes  of 
simplicity,  most  investigations  of  the  electron-phonon  interaction  in  semiconductors  have 
made  use  of  empirical  pseudopotentials  [2-6],  mostly  in  the  local  approximation  [2-5]. 
As  the  true  crystalline  pseudopotential  is  necessarily  nonlocal,  some  authors  have  added 
correction  terms  [5,6].  While  useful  in  gaining  insight,  these  empirical  ^proaches  suffer 
from  their  sensitivity  to  several  adjustable  parameters  as  some  authors  have  noted  [5,  7]. 
For  example,  before  any  electron-phonon  analysis  takes  place,  all  of  the  local  form  '^-'ctors 
must  be  treated  as  free  parameters  and  adjusted  such  that  the  resulting  bandstructure 
matches  experimental  values  at  a  suitably  defined  set  of  points  in  the  Brillouin  zone  [8]. 
Additionally,  due  to  the  nature  of  the  electron-phonon  interaction  potential,  one  must 
still  choose  among  various  interpolation  and  extrapolation  schemes  between  and  beyond 
these  discrete  adjustable  form  factors  in  reciprocal  space.  Of  particular  concern  is  the 
finding  that  the  most  physically  meaningful  choice  of  extrapolation  criteria  do  not  yield 
the  best  results  [5].  For  these  and  other  reasons,  several  authors  have  chosen  an  alterna¬ 
tive  approach. 

It  has  long  been  known  [9]  that  electron-phonon  matrix  elenents  calculated  using  pseu¬ 
dopotentials  can  in  principle  be  just  as  accurate  as  those  cai-nlr-  ted  using  the  true  poten¬ 
tial,  as  long  as  the  pseudopotentials  are  calculated  directly  from  the  true  potential.  This 
has  led  to  several  oh  initio  investigations  based  on  denrity  functional  theory  [10, 11, 12]  in 
the  local  density  ^>proximation  (LDA).  Such  self-consistent  Kohn-Sham  (K-S)  type  cal¬ 
culations  quite  naturally  avoid  all  of  the  difficulties  of  the  empirical  approach  mentioned 
above.  FHirthermore,  the  self-consistent  LDA  calctilations  implicitly  include  the  effects  of 
atomic  polarization,  which  is  impossible  to  include  using  the  empirical  pseudopotenials. 
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With  all  of  these  benefits,  the  K-S  method  has  the  drawback  that  modem  computational 
resources  limit  its  i4>plication  to  a  small  set  of  high-symmetry  transitions  due  to  the  re¬ 
quirement  of  self-consistency. 

An  alternative  approach  [7]  which  uses  the  ab  initio  pseudopotentials  yet  avoids  the  prob¬ 
lem  of  self-consistency  involves  an  approximation  to  the  change  in  crystalline  charge  doi- 
sity  when  the  atoms  are  displaced.  This  approximation,  based  on  the  Harris  functional 
ansatz  [13],  is  to  replace  the  self-consistent  K-S  potential  with  one  derived  from  overlap¬ 
ping  atomic  charge  densities.  The  benefit  of  this  approach  is  the  flexibility  to  calculate 
electron-phonon  matrix  elements  for  arbitrary  transitions,  yet  it  remains  a  first-principles 
theory.  In  the  following  sections,  the  method  is  briefly  described,  and  then  used  to  gen¬ 
erate  band-  and  wavevector-dependent  scattering  rates  for  silicon. 

n.  The  Model 

One  may  write  a  crystalline  density  functional  non-interacting  Hamiltonian  using  pseu¬ 
dopotentials  as 

»  =  -5^’  +  E  +  Wr)  +  »'hI»(>-)1  +  V„|n(r)l  +  V„|n(r)l,  (1) 

where  V^*,/oe(r)  is  the  local  crystalline  ionic  pseudopotential,  A VjjJ,  „^(r,  r')  is  the  nonlo¬ 
cal  1-dependent  ionic  crystalline  pseudopotential,  and  VH[n(r)]  is  the  Hartree  potential. 
Ks[n(r)]  and  V^[n(r)]  are  functional  derivatives  of  the  exchange  and  correlation  energies 
with  respect  to  pseudocharge  density.  Single  electron  wavefunctions  are  expanded  in  a 
plane  wave  basis  as 

E  (2) 

and  matrix  elements  of  this  pseudopotential  Hamiltonian  are  taken  with  respect  to  these 
basis  functions  leading  to  the  familiar  secular  equation 

det  |(|(k  -I-  G')’  -  Ey,n)6G>,G  +  +  G,  k  -|-  G')|  =  0  (3) 

which  defines  the  eigen-energies  and  wavefunctions. 

As  is  usual,  the  bare  ionic  potential  (nucleus  with  core  electrons)  is  expanded  in  a  series 
under  the  assumption  of  small  displacements, 

V»(r)  =  E  W('  -  «)  -  ’■«)  +  -  Ry  -  T,)  +  ... )  ,  (4) 

ignoring  multi-phonon  processes  [14]  and  any  polarization  of  the  bare  ions  themselves 
under  displacement.  Since  the  first  order  term  is  much  smaller  than  the  zero  order  term, 
the  electron-phonon  interaction  is  treated  perturbatively,  with  the  screened.. first  order 
term  gi  ing  rise  to  transitions  between  eigenstates  of  the  screened  zero  order  term.  In  the 
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Fig.  1.  Total  electron-phonon  scattering  rate  as  a  function  of  energy.  Rates  are  into 
states  in  the  first  4  conduction  bands  out  of  the  first  two. 

adiabatic  approxinoiation,  this  is  expressed  as: 

V.,-,A(r)  =  =  (5) 

with  transform  [14] 

Ve,-pfc(q  G)  =  £-»(q  -I-  G,  q  -I-  G^ a;  =  0) V.,-p*^(q  -H  G'),  (6) 

G' 

defining 

Kt-^(r)  =  l^iR,^VK"(r-R,-r.)  (7) 

jiO 

Under  the  assumption  that  the  atonuc  pseudocharge  density  moves  rigidly  with  the  atoms 
of  the  crjrstal  lattice  under  displacement,  a  linearized  interaction  potential  reduces  eq’n 
(5)  to  the  inner  product  of  the  ionic  displacement  and  the  gradient  of  the  screened  ionic 
potential.  In  this  case,  the  calculation  of  electron-phonon  matrix  elements  proceeds  closely 
along  the  lines  of  the  standard  rigid  ion  theory,  with  only  minor  modification. 

m.  Discussion 

The  method  of  [15]  was  used  in  conjunction  with  the  Fermi  Golden  Rule  and  a  realistic 
phcmon  dispersion  to  generate  band-  and  wavevector-dependent  electron-phonon  scatter¬ 
ing  rates.  Figure  1  shows  the  total  energy-dependent  scattering  rate,  with  the  density- 
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ol-8t«t«8  included  as  a  rderence.  The  scattering  rate  is  found  to  be  luger  than  that  of 
the  empirical  method  [5],  especially  at  high  energies.  As  an  indication  of  the  anisotropy 
of  these  rates,  figure  2  depicts  an  energy  iso»'  'rface  in  the  first  conduction  band,  shaded 
proportionally  to  total  rate  at  each  point  in  the  Brillouin  zone. 
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Abstract 

Disorder-induced  scattering  due  to  the  random  distribution  of  atomic  qtecies  the  constituent 
semictxiductors  in  alloys  can  lead  to  intervalley  scattering  transitions.  A  mediod  based  on  a 
wavevector  dependant  formulation  of  the  Qtherent  Potential  Approxiroation  is  used  to 
calcuUue  the  intervalley  scattering  potentials. 


I  Introductioa 

Recent  experimental  work  of  Kalt  et  al.  [1]  indkate  that  disorder  scattering  can  have  large 
wavevector  components  of  the  scattering  potential,  which  nuy  give  rise  to  intravalley 
scattering.  Grein  et  al.  [2]  also  calcula^  the  lifetime  broadenings  of  electrons  due  to 
disorder-induced  scattering  from  the  F  valley  to  the  side  valleys  using  an  empirical 
pseudc^tcnial  formulation,  and  concluded  that  this  is  the  dominant  scattering  process  for 
electrons  in  the  F  valley  fninimmti  of  AlGaAs  with  coaq)Osition  x  >  0.4.  In  &s  study,  we 
derive  a  genraal  formiilatioo  for  treating  diagonal  disorder-induced  intervalley  scattering 
transitions  due  to  diagond  disorder  based  on  the  Coherent  Potential  Approximation  (CPA). 
Previous  formulations  of  the  CPA  for  application  to  alk^  bandstructures  average  out  tte  self^ 
energy  over  the  entire  Brillouin  zone.  In  this  work,  the  wavevector  dependence  of  the 
relevant  quantities  ate  retained.  The  scattering  po^tial  Vsc(q),  which  causes  a  scattering 
transititm  with  wavevector  q,  is  found  by  calculating  the  imaginary  part  of  the  CPA  self¬ 
energy  at  the  f  point  of  interest 


n  Coherent  FMential  Appraximatiaii 

Tlw  coherent  potential  araroximation  method  has  been  used  previously  to  treat  the  electronic 
structure  of  alloys  [3-5].  The  exact  Hamiltonian  for  an  alloy  is  written  as  die  sum  of  an  ordered 
virtual  crystal  Hamiltonian,  whose  solution  is  known,  and  a  random  potential  due  to  the 
random  distribution  of  atoms: 


Haiioy  =  Ho  +  Vnodom , 

(1) 

Vtandoin^Z  CVr~Vo). 

(2) 

r 


The  random  part  of  the  crystal  potential  is  replaced  with  an  effective  potential  (self-energy)  in 
the  CPA  approximation,  which  is  described  by  an  effective  Hamilmnian 

Heff-Ho  +  Kk).  (3) 

From  the  the  iterative  stdution  of  the  Dyson’s  equation  relating  the  Green’s  functions  for  the 
exact  alloy  Hamiltonian  and  the  effective  Hamilttxiian 
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Oiriby  "  Gaff  >  OeoCVivaan  ~  I)Galii9 

diB  aiomk  acttierins  matrix,  T,  can  be  defined  as 

T. 

“  I-(V-I)Oeff  ’ 
which  transfonns  (4)  into  the  fbnn 

GaUoy  -  + OerfrCteff . 

Taking  the  configuration  average  both  sides, 

<Galloy>  *  Gfff + Gefr*!T^eff » 


(4) 

(5) 

(6) 

(7) 


results  in  a  self-consistency  relation  for  the  self-energy  £0t)-  This  leads  to  the  condition  that 
the  ccmfiguration  average  td  the  atomic  scattering  matrix  vanish  at  every  point  in  die  Biillouin 
zone: 


^ — Ymzm — ^(1.,, — yB»)-aw — «  (» 

I-(VA«-aW)OrfKll)  I-(VB(k)-S(^))OeB(B) 

VA(k)  and  VgOt)  are  the  and-bonding  potentials  of  die  A  and  B  semiomductors,  calculated 
frcm  die  ctmduction  band  energy  solutions,  and  are  referenced  with  respect  to  die  vacuum 
level  Chen  et  al.  [3]  use  an  ^iproximation  by  which  die  cdfective  Gierers  function,  and  the 
self-ener^  term  are  diagonalized.  The  effective  Green’s  function  is  of  the  form  of  Bloch 
sums,  which  are  valid  for  ti^t-binding,  <xr  bond-orbital  basis  functions  and  result  in  die 
cancellation  of  terms  in  the  Bnllouin  zone  integration.  Then,  the  effective  Green’s  function  is 
(rfthefoRn 


GeiKk)-  I 

k' 


1 

Ek-Ek'-Ik' 


(9) 


By  discretizing  die  Biillouin  zone  intoNpoints,  we  obtain  an  system  of  coupled  nonlinear 

equations  for  Zk  to  be  solved  self-ccmsistantly.  Through  a  local  k  approximation,  we  first 
cidculate  the  density  of  states  only  at  the  k-point  of  interest,  dirough  which  the  nonlinear 
system  of  equations  can  be  decoiqil^  Rewriting  die  ef^tive  Green’s  function  as 


Geff(k) 


-  — +  Z' - ^ - 

Zk  Is  Ek-Ek'-Zk' 


(10) 


the  summation  term  is  neglected  initially.  We  have  checked  that  this  iqiproximation  is  valid, 
and  find  that  die  first  term  is  about  five  times  larger  than  the  summation  term.  The  resultant 
self  enmgies  Zg.  from  the  solution  of  (8)  are  fed  back  into  die  Green’s  function  expression  of 
(9)  and  the  iteration  is  performed  until  self-consistency  is  reached.  The  self-energies  do  not 
change  appreciably  afier  the  first  iteration. 


Hgure  1.  CPA  Self-Energy  in  the  BriUouin  zone  fcrx  «  0.2  and  0.6  in  AlGaAs.  Circles  and 
diimionds  are  the  real  and  imaginary  cooqwnents  ci  die  self-energy  for  x  »  0.2,  and  the  squares 
and  triangles  are  die  real  and  imaginary  s^-energies  forx  »  0.6. 


Using  the  semi-enmirical  dght-binding  method  to  calculate  the  eigenenergies,  the 
scalar  equations  resulting  from  me  above  qiproximations  are  solved  using  a  ^fowtoo-IUqmson 
iteration  for  X(k)  at  die  discretized  p^nts  in  die  irreducible  1/48-th  wi^ge  of  the  Brilkniin 
zone.  The  results  for  the  real  and  imaginaiy  components  of  the  self-energy  are  shown  in  1 

for  two  different  conqwsitions  of  AKlaAs.  b  is  aeea  that  the  real  part  of  die  self-energy,  which 
is  a  band-renormalization  term  in  the  virtual  crystal  approximadon,  results  in  a  correction  to  me 
band  energies  whose  magnitude  is  less  than  10  meV  throughout  die  BriUmiin  zone.  The 
imaginary  part  is  equal  to  the  scattering  mitential,  and  shows  a  structure  relamd  to  the 
syimnetn^  points  F,  L  and  X.  Between  me  A  and  K  points,  we  have  a  dqi  in  die  self-energy, 
which  is  actually  a  result  of  the  semi-eiiqnrical  tight  binding  approximatioo  to  me 
bandstmcture.  In  an  basis,  the  energy  bands  true  ture  between  the  X  and  K  points  has  a 
degeneracy  point,  wdule  in  an  Engnrical  Pseudopomntial  calculadoo  of  the  band  structure  oi 
AlGaAs,  such  a  degeneracy  point  is  not  sera.  Toe  conqxisiticm  dqiendence  of  the  imaginary 
part  of  the  self-energy  for  tte  wavevector  components  L  andX  is  shown  in  Ke.  2.  Tbmisa 
marked  deviation  from  a-vx(l-x)  type  of  be^viour,  which  is  expected  from  a  binomially 
distributed  random  atomic  energy.  Thus,  die  bowing  pararrMer  is  not  a  simple  scalar  quand^. 


m  Applic^iou 

We  qiply  our  dismder-scattering  potemial  results  to  the  simuladon  of  a  fennosecond  dme- 
resolved  purrqi  prtdie  experim^  m  intervalley  scattering  in  Alo.6GaO.4As  by  Wang  et  a/.[6]. 
The  interval^  scattering  transition  is  an  L-wavevector  transition,  whose  scattering  potential  is 
sera  to  be  18  meV  from  Hp.  1  and  2.  We  include  diis  scattering  process  in  an  Ensemble 
Monte  Cario  siirmlation  of  intervall^  scattering  in  AlGaAs.  The  dam  points  in  Hg.  3  ate  die 
X-valley  populadon  in  arintrary  mdts.  A  400  fenttosecondFWHM,  2.12  eV  laser  pdse  excites 
catrien  into  the  L  and  X  vaU^  (die  laser  pulse  is  cratered  at  r  b  1(X)0  femtoseconds  in  the 
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(solid  line). 


figure).  Pboto-exdted  canien  undergo  inter-  and  intra-valley  phonao-  and  disorder-induced 
scattering  events  to  pile  up  at  the  botiotn  of  dre  X  valley.  Tte  population  of  the  X  vall^  is 
monitoted  by  a  SOO  remtosecond  FWHM  convtdving  pulse  which  causes  an  infiared  transition 
to  the  X-^  valley.  Previously,  we  had  reported  a  parameter  fit  of  the  non-polar  optical 
deformation  potential  constant  to  be  »  U  ±  0.5  x  lOB  eV/on  [7].  The  results  of  the 
siiiBi]ationfioravalneafPLX»l.Sx  1(f  eV/cm  par  Innary  alk^  mode  are  shown  in  Bg.  3  for 
three  different  disorder-induced  intenralley  scanering  potentials.  It  can  be  seen  that  the  effect 
of  including  disorder-induced  scattering  inooesses  for  LrX  intervalley  scattering  is  small. 


IV  CoBduaioii  < 

Disorder-induced  intervalley  scattering  is  investigated  using  a  Coherent  Potential 

Approximation  to  model  the  random  potei^  in  the  all^  AKkAs.  The  imaginary  part  of  the 

self-energy  correction  is  responsible  for  scattoing  so  we  can  use  this  self-energy  to  derive 

scattering  potentials.  In  particolar,  a  k-dependant  formulation  for  the  self-energy  gives  us  the 

disorder  potential  compoiMts  directly  responsible  ftx  the  large  wavevector  scattering  < 

transitions  udiich  can  rewt  in  intervalky  transfer.  The  disostier-inthiced  intervalley  scanering 

potential  for  F-X  transitkns  (an  X-wavevector)  for  the  esqierimental  case  of  [1]  is  found  to  be 

38  meV  at  conqxisititm  x  •  0.4,  somewhat  smaller  than  the  value  reported  (S3  meV). 

However,  tins  value  seems  overestimated  from  the  detivatum  in  [1].  Intervalley  scattering 

between  the  L  and  X  valleys  has  <mly  a  weak  disonder-indiKed  scattering  potential.  For 

AlGaAs  witii  conuxmtion  x  >  0.6,  disorder-induced  scattering  processes  for  L-X  intervalley  < 

scattering  are  mall  coaqMued  with  idKMioii-indnoed  processes. 


( 
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HgureS.  Effect  of  duofder-induoed  L-X  intervaUey  scattering.  The  dotted  line,  solid  line  and 
the  dasM  line  are  for  scattering  potentials  Vk(L)  =  0. 18  meV  and  25  meV  respectively.  The 
diamonds  are  the  data  points  of  Wang  era/.  [6]. 
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Abstract 

A  semiclassical  Monte  Carlo  technique  is  employed  to  simulate  the  steady- state  electron 
transport  in  sihcon  dioxide  at  intermediate  and  high  electric  fields.  The  electronic  struc¬ 
ture  is  modelled  by  a  single  parabolic,  by  a  single  nonparabolic  as  well  as  an  isotropic 
four-band  model.  We  find  that  the  electronic  behavior  of  silicon  dioxide  is  mainly  influ¬ 
enced  by  a  single  nonparabohc  conduction-beind.  The  injection  of  electrons  into  silicon 
dioxide  is  also  investigated  in  order  to  extract  the  thermalization  length  of  electronic 
carriers. 

I.  Introduction 

Silicon  dioxide  is  of  vital  interest  for  “metal  oxide  semiconductor”  (MOS)  technology 
because  of  its  importance  as  an  insulator  for  gate  electrodes.  For  “Ultra  Large  Scale 
Integrated”  (ULSI)  circuits  the  thickness  of  the  insulating  film  is  only  comprised  of  a  few 
nanometers,  which  results  (for  typical  bias  voltages)  in  a  normal  field  approximately  of  1  - 
10  megavolts  per  centimeter  (MV/cm).  Experimental  and  theoretical  studies  give  evidence 
that  material  breakdown  will  not  occur  under  the  influence  of  applied  field-strengths  as 
high  as  20  MV/cm  [1][2].  Under  these  enormous  fields,  the  electronic  distribution  becomes 
unstable  if  the  energy  being  gained  from  the  field  can  no  longer  be  given  to  the  lattice. 
Three  different  scattering  events  are  involved  to  model  the  transport  behavior  of  electrons 
in  Si02,  namely,  polar  longitudinal  optical  (LO)  phonons,  nonpolar  optical  and  nonpolar 
acoustic  phonons  [1] [3]  [4]  [5]  [6]  [7] [8]. 

II.  The  Physical  Model 

In  low  electron  energy  levels,  polar  longitudinal  optical  phonons  are  the  dominant  scat¬ 
tering  process  in  silicon  dioxide.  The  electrons  lose  a  large  amount  of  energy  to  the  lattice 
due  to  a  strong  interaction  of  the  polar  phonon  modes  via  the  polarization  field  of  the 
ions,  but  at  high  and  intermediate  fields  they  cannot  prevent  “velocity  runaway”  or  even 
material  destruction.  Nonpolar  acoustic  phonons  force  electrons  to  scatter  and  stabilize 
the  electronic  distribution.  As  electrons  reach  the  threshold  for  the  emission  of  nonpo¬ 
lar  aco'istic  phonons,  the  probability  of  having  Bragg  reflections  (Umklapp-processes) 
increases  keeping  electronic  carriers  from  gaining  more  energy  ffom  the  fleld  [3]  [7]  [9]. 
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III.  The  Transport  Model 


In  this  section  we  briefly  describe  our  Monte  Carlo  algorithm  to  solve  the  Boltzmann 
transport  equation  (BTE).  An  excellent  review  on  this  method  is  given  in  [10].  Some  minor 
improvements  and  differences  of  this  numerical  method  will  be  briefly  reported  here.  The 
equation  of  motion  and  the  duration  of  free  flight  are  solved  simultaneously  by  employing  a 
Runge-Kutta  algorithm  instead  of  the  usual  self-scattering  scheme.  After  performing  a  free 
flight  the  scattering  process  is  randomly  chosen  according  to  the  partial  scattering  rates. 
If  one  scattering  process  is  selected  the  after-scattering  state  of  the  electron  is  calculated. 
For  nonpolar  electron-phonon  collisions  the  polar  6  and  cizimuthal  angle  (f)  are  uniformly 
distributed  [10],  whereas  the  LO  phonons  favor  large  angle  scattering.  To  compute  the 
polcir  angle  we  use  a  modified  rejection  technique  with  a  nonconstant  enveloping  function. 
The  azimuthal  angle  of  the  LO  phonons  is  chosen  randomly.  Having  calculated  the  state 
of  the  scattered  electron  we  perform  another  free  flight  till  the  maximum  number  of 
scattering  events  is  reached. 


rV.  Results 


To  model  the  band-structure  in  SiOj  we  implemented  an  isotropic  four-band  model  with 
one  nonparabolic  (nonparabolicity  a)  and  three  parabolic  bands  [11] 
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Table  1.  Parameters  of  the  four-band  model  used 

The  parameters  of  the  band-structure  are  summarized  in  Table  1.  The  data  were  extracted 
from  band  calculations  of  Chelikovsky  and  Schluter  [12]. 

In  contrast  to  the  electronic  character  of  bands  one  and  three,  the  bands  two  and  four 
show  hole-like  behavior.  The  density  of  states  (figure  1)  of  one  parabolic,  one  nonpaxabolic 
and  the  four-band  model  are  compared,  whereas  the  first  bands  have  the  same  mass.  It  is 
clearly  seen  for  intermediate  and  high  energies  that  the  density  of  states  strongly  differs. 
The  main  features  of  a  realistic  band-structure  with  two  maxima  at  5.5  eV  and  9  eV  and 
one  minimum  at  7  c  Vare  well  reproduced  and  strongly  influence  the  electronic  distribution 
at  all  energies. 
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Figure  1.  DOS  of  different  band- 
structures  used  for  SiOj. 


Figure  2.  Scattering  rates  of  different  band- 
structures  used  for  SiO: 


dcctric  flcM  [MV/on]  dcctiic  field  [MV/cm] 


Figure  3.  Drift  velocity  of  different  band-  Figure  4.  Mean  kinetic  energy  in  Si02  corn- 
structures  in  Si02-  pared  with  the  data  of  Ref.  7. 

Figure  2  presents  the  total  scattering  rate  for  a  temperature  of  300  K  and  compares  it 
with  a  single  conduction-band.  Nonpolar  acoustic  phonons  set  in  at  about  2.75  cF  as  the 
dominant  scattering  process  (U-process).  Compared  with  one-band  models  the  different 
character  of  the  four-band  model  again  results  in  two  maxima  and  one  minimum.  The 
discontinuity  at  the  peak  reflects  the  intravalley  character  of  U-processes. 

The  dependence  of  the  drift  velocity  of  electronic  carriers  in  Si02  versus  electric  field  is 
plotted  in  figure  3.  It  increases  till  U-processes  occur.  A  parabohc  one-band  model  tends 
to  lower  velocities,  whereas  nonparabolicity  increases  the  velocity.  The  split-off  between 
the  nonparabolic  band-model  and  the  four-band  model  is  caused  by  a  non-negligible  occu¬ 
pancy  of  the  second  band  at  high  electric  fields.  Our  data  are  compared  with  the  results 
of  Fischetti  [7]  [13]. 

The  energy  is  plotted  in  figure  4.  We  observe  that  a  single  nonparabolic  conduction-band 
and  the  four-band  model  do  not  exhibit  any  deviation,  moreover,  they  almost  demonstrate 
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quantitative  identical  values.  Three  different  techniques  have  been  employed  to  extract  the 
energy  as  a  fimction  of  the  appUed  electric  field,  namely  the  carrier-separation  technique, 
the  electroluminiscence  method  and  finally  the  vacuum-emission  technique  [1],[2],[7]. 


Figure  5.  Spatial  energy  distribution  in  Figure  6.  Spatial  energy  distribution  in 
SiOj  for  electrons  injected  SiOa  for  electrons  injected 

with  0.1  cF.  with  I  eV. 

In  figure  5  we  investigate  the  thermalization  length  of  electrons  in  SiOo  employing  one 
nonparabohc  band.  Electrons  are  injected  at  the  left  boundcuy  according  to  an  Boltzmann 
distribution  with  an  average  energy  of  0.01  eV.  We  find  that  the  thermalization  length  is 
dependent  on  the  applied  electric  field  as  well  as  on  the  energy  of  the  injected  electrons.  At 
low  field-strengths  tae  distribution  shows  that  the  electrons  requires  an  average  distance  of 
about  30  nm  to  obtain  the  mean  kinetic  energy,  whereas  at  high  fields  the  average  distance 
to  thermcilize  is  obviously  lower  than  10  nm.  If  the  average  energy  of  the  applied  field  does 
not  reach  the  threshold  of  U-processes  the  scattering  rate  is  rather  low  and  the  mean  free 
path  large.  Therefore  scattering  events  are  rarely  resulting  in  long  thermalization  lengths. 
For  high  fields  the  mecin  free  path  is  short  eind  U-processes  are  dominant  favoring  large- 
angle  scattering,  which  thermalize  the  carriers  within  few  nm. 

Figure  6  presents  the  distribution  of  electrons  in  Si02  that  are  injected  with  1  eU  in 
average.  Again,  we  observe  that  the  thermalization  length  of  electrons  is  dependent  on 
the  electric  field,  but  shorter  than  for  carriers  injected  with  low  energy.  The  average 
length  for  electrons  to  thermalize  is  lower  than  15  nm.  For  high  fields  it  is  clearly  seen 
that  electrons  thermalize  within  10  nm.  In  contrast  to  injected  electrons  with  low  energy 
the  probability  of  suffering  U-processes  is  high  leading  to  a  short  mean  free  path  and  a 
short  thermalization  length. 
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Abatrmct 

Monte  Carlo  particle  metboda  have  a  nniqne  role  in  sexnicondnctor  device  simulation,  since 
they  allow  one  to  s<dve  the  Boltimann  equation  statistically,  with  the  inclusion  of  many 
physical  details  which  cannot  yet  be  included  completdy  in  other  approaches.  Of  the  main 
limiting  drawbacks  of  the  teclmique,  memory  requirements  and  computational  costs  are 
much  alleviated  by  the  increasing  power  of  computers.  The  problems  due  to  statistical 
noise  can  in  some  applications  be  corrected  by  ad  hoc  techniques.  This  paper  briefly 
reviews  the  state-of-the-art  of  Monte  Carlo  device  simulation  and  elaborates  on  the  future 
applications  of  the  method. 


I.  Introduction 

Monte  Carlo  particle  simulation  has  been  a  powerful  tool  for  the  investigation  of  transport 
in  semiconductors  for  well  over  twenty  years.  The  evolution  of  Monte  Carlo  methods 
has  been  directly  influenced  by  advances  in  computers,  which  have  made  possible  the 
implementation  of  more  and  more  refined  physical  models  [1,2].  Originally,  applications 
were  limited  to  tracking  the  evolution  of  a  single  particle,  to  obtain  steady-state  time- 
averages  of  transport  parameters.  The  single  partide  Monte  Carlo  technique  is  adequate 
to  study  bulk  semiconductor  properties  under  uniform  fidd  conditions,  3ddding  as  typical 
results  the  distribution  function,  average  vdodty  and  energy,  valley  occupation  percentage, 
and  vdodty-fidd  characteristics.  The  single  partide  mo<.:d  is  also  adequate  to  obtain  local 
information  in  device  structures  for  which  a  potential  distribution  is  approximatdy  known. 

T^andent  and  setfcondstent  nmulations  were  implemented  when  the  memory  of  com¬ 
puter  was  increased  to  allow  the  simultaneous  tracking  of  thousand  of  particles.  The 
so-called  ensemble  Monte  Carlo  technique  has  then  made  possible  a  whole  new  range  of 
selfconsistent  applications  which  have  required  the  indusion  of  methods  for  the  local  eval¬ 
uation  of  electronic  forces,  e.g.  Poisson’s  equation  [3].  The  early  simulation  models  have 
treated  the  bandstructure  with  an  analytical  parabolic  or  non-parabolic  approximation. 
The  energy  range  for  hot  dectron  analysis  has  been  considerably  extended  with  the  in¬ 
troduction  of  algorithms  which  implement  numerically  a  complete  bandstructure  of  the 
semiconductor  material  [2,4].  Initially  limited  to  bulk  material  and  single  particle  applica¬ 
tions,  the  full  bandstructure  Monte  Carlo  has  been  extended  in  recent  years  to  ensemble 
sdfcondstent  applications  and  can  be  now  run  &irly  eflidently  on  workstations  [4,5].  While 
hardware  improvements  are  making  Monte  Cado  applications  more  realistic,  many  efforts 
have  been  devoted  to  overcome  the  natural  limitations  of  the  technique,  to  optimize  the 
algorithms  and  to  take  advantage  of  new  hardware  c^abilities  to  introduce  more  advanced 
physical  modds. 

The  flow-chart  of  a  sdfoonsistent  Monte  Carlo  device  simulation  is  relativdy  simple. 
The  method  uses  a  time-dependent  approach,  which  besides  providing  transient  results 
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may  alto  be  run  until  a  steady-state  is  achieved.  The  iteration  oscillates  between  a  block 
which  utilises  the  information  on  charge  density  to  evaluate  the  electrical  forces  in  space, 
and  a  block  which  tracks  the  particle  movement  within  a  given  timestep.  The  frequency  of 
forces  update  is  chosen  as  a  trade-off  between  accuracy  of  the  physics  which  requires  very 
frequent  force  recalculation,  and  overall  efficiency. 

The  particle  movement  is  divided  into  two  distinct  parts:  free  ffight  under  the  influence 
of  the  electrical  forces,  and  scattering  events  that  terminate  the  flights.  The  length  of  the 
free  flight  trajectory  is  determined  statistically,  by  relating  the  total  scattering  probability 
rate  to  a  pseudo-random  number  picked  from  a  uniform  sequence  generated  by  the  com¬ 
puter.  Once  the  flight  is  terminated,  random  number  techniques  are  again  used  to  select 
the  type  of  scattering,  according  to  the  relative  rate  strength  of  the  various  mechanisms  at 
that  particular  energy,  and  to  determine  the  final  state  after  the  scattering  event. 

The  particles  are  treated  as  classical  objects  obeying  Newtonian  mechanics  during  the 
free  flights,  and  the  scattering  events  are  assumed  to  be  instantaneous.  Just  a  few  lines 
of  code  are  necessary  to  evaluate  the  momentum  evolution  during  a  timestep,  using  the 
classical  law  of  accelerated  motion.  In  an  analytical  band  formulation,  the  energy  at  the 
end  of  the  timestep  is  directly  computed  by  evaluation  of  a  simple  formula.  In  the  full 
bandstructuxe  formulation,  since  the  energy  values  are  available  only  on  a  3-D  grid  in 
the  Brillouin  sone  of  momentum  space,  the  energy  at  the  end  of  the  timestep  has  to  be 
evaluated  by  interpolation.  This  process  is  one  of  the  major  bottlenecks  in  the  simulation, 
and  innovative  gridding  approaches  have  been  recently  applied  to  implement  faster  energy 
evaluation  techniques  [6]. 

The  weights  of  the  d^erent  parts  of  the  code,  in  terms  of  the  overall  CPU  time,  vary 
according  to  the  implementation  and  running  conditions.  In  most  applications,  the  force 
evaluation  by  solving  the  Poisson’s  equation  represents  only  a  few  percents  of  the  compu¬ 
tation,  when  it  is  solved  at  typical  time  intervals  of  10  fs.  However,  in  some  cases  (e.g. 
high  doping)  the  time  between  two  Poisson  solutions  must  be  reduced  to  1  fs  or  less,  to 
avoid  spurious  plasma  oscillations  of  the  particle  gas,  and  in  2-D  or  3-D  simulations  with 
a  large  number  of  grid  pmnts,  such  numerical  solutions  may  carry  a  considerable  weight. 
When  forces  are  evaluated  bu  a  full  molecular  dynamics  approach,  the  computational  cost 
for  this  may  be  dominant,  although  there  are  multipole  techniques  which  can  be  utilized 
to  dramatically  reduce  the  CPU  time  without  sacrifidng  precision  [7,8]. 


n.  Statistical  noise 

Statistical  noise  due  to  the  randonmess  of  the  events  and  discreteness  of  the  particles 
always  affects  Monte  Carlo  simulation  results.  In  many  cases,  it  is  sufficient  to  increase  the 
number  of  particles  and  average  in  time  the  ensemble  averages,  to  improve  the  accuracy 
of  the  collected  statistics  for  various  observables.  However,  there  are  cases  where  a  mere 
increase  of  the  number  of  samples  is  not  practical.  This  is  typically  true  in  the  case  of  effects 
which  depend  on  high  energy  tails  of  the  distribution  function.  An  example  is  the  injection 
of  carriers  in  the  oodde  of  a  MOSFET  structure.  Steady-state  parameters  like  potential 
and  carrier  distribution  in  the  device  are  not  very  much  affected  by  these  rare  events,  but 
the  evaluation  of  gate  currents  is  exclusively  determined  by  them.  It  is  necessary  to  assign 
different  weights  to  particles  in  different  energy  ranges  in  order  to  emphasize  the  statistics 
of  high  energy  tails,  while  preserving  the  overall  physical  charge  for  force  evaluation. 

If  the  transient  behavior  is  of  interest,  it  is  not  possible  to  perform  time  averages 
and  only  an  increase  of  the  size  of  the  ensemble  would  improve  the  statistics,  but  again. 
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tlus  it  iddoin  practical.  Small-signal  parameters  of  microwave  devices  can  be  determined 
for  instance  by  Fourier  transform  of  tbe  transient  currents.  Deterministic  models  like 
drift-diffusion,  which  is  not  affected  by  statistical  noise,  are  frequently  used.  They  are 
also  applied  to  study  large-signal  transient  response  of  distal  circnits.  The  Monte  Carlo 
technique  can  be  applied  when  drift-diffusion  fails,  provided  that  integrated  quantities, 
rather  than  instantaneous  ones,  are  considered.  This  is  done  by  extending  an  idea  first 
introduced  by  Hockney  and  Eastwood  [2],  where  one  keeps  track  of  the  total  charge  or 
current  which  transits  through  contacts.  When  the  transient  simulation  is  carried  through 
steady-state,  it  is  possible  to  identify  two  contributions  to  the  cumulative  charge,  associated 
to  the  transient  and  to  the  steady-state  regime  [9].  Since  the  noise  fluctuations  are  not 
very  large  for  the  time-integrated  charge,  it  is  possible  to  precisely  fit  in  time  the  transient 
contribution  with  a  polynomial  or  a  combination  of  exponentials.  This  procedure  yields 
very  smooth  curves  for  the  transient  currents,  which  can  now  be  Fourier  transformed  to 
yield  the  intrinsic  small-signal  parameters. 

m.  Optimization 

The  core  of  the  Monte  Carlo  algorithm  determines  the  times  of  flight  t  by  solving  the 
integral  equation  —Inr  =  /g  A(t)dt',  where  r  is  a  uniform  random  number  between  0  and 
1,  and  A(t)  is  the  total  scattering  rate  which  changes  in  time  as  the  particle  momentum 
and  energy  vary  under  the  influence  of  the  fields.  The  choice  of  an  appropriate  solution 
is  quite  important  for  an  efficient  algorithm.  An  important  concept  is  the  self-scattering, 
a  fictitious  event  which  does  not  affect  the  electron  state  when  selected.  A  self-scattering 
rate  can  be  adjusted  as  convenient  to  facilitate  the  evaluation  of  the  flight  time,  since  it 
does  not  affect  the  statical  properties  of  the  process.  The  simplest  approach  is  to  add  a 
self-scattering  rate  which  makes  the  total  rate  constant,  so  that  the  integral  above  can  be 
trivially  determined. 

A  comparison  of  various  techniques  can  be  found  in  [10].  Self-scattering  rates  can  be 
fixed  as  a  function  of  energy,  or  can  be  dynamically  adjusted  in  time  along  the  particle 
trajectory.  The  goal  of  optimization  of  this  process  is  to  reduce  as  much  as  possible  the 
amount  of  self-scattering  while  maintaining  but  still  retaining  it  to  guarantee  that  the  solu¬ 
tion  of  the  integral  is  always  statistically  correct,  rather  than  introducing  approximations. 
The  constant  time  technique  [10,11]  offers  a  good  trade-off  for  self-consistent  applications. 
The  simulation  time  is  divid^  into  small  constant  steps  for  all  the  particles,  and  a  self¬ 
scattering  rate  is  adjusted  to  make  the  total  rate  constant  in  time  within  that  interval.  The 
integral  is  solved  by  adding  up  trivial  contributions,  until  the  equality  is  satisfied. 

As  mentioned  earlier,  the  determination  of  the  energy  at  the  end  of  a  timestep  is  a  time 
consuming  operation  in  fall  bandstructure  calculations,  which  can  create  major  bottlenecks. 
Optimization  can  be  achieved  by  using  a  tetrahedral  mesh  in  momentum  space,  instead  of 
a  regular  grid,  arranged  so  that  nodes  of  a  tetrahedron  are  positioned  on  adjacent  energy 
isosurfaces  [6].  \l^th  these  grids,  a  linear  expression  can  be  used  to  determine  directly  the 
energy  for  given  momentum  coordinates,  with  precision  controllable  by  adapting  the  grid 
locally.  While  the  construction  of  this  algorithm  involves  a  considerable  initial  develop¬ 
ment  cost,  this  technique  promises  to  be  a  breakthrough  which  should  make  the  full  band 
approaches  not  more  expensive  than  analytical  band  algorithms. 


rV.  Supercomputmtion 
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The  advent  of  tupercomputen  has  offered  new  opportunities  to  improve  the  performance  of 
Monte  Carlo  codes  and  increase  the  sise  of  the  problems  to  be  solved.  The  particle  transport 
has  an  inherent  parallel  behavior  which  can  be  explmted,  but  the  main  obstacle  is  the  fact 
that  the  histories  of  different  particles  can  be  very  different  due  to  the  randomness  of  the 
processes,  and  such  a  difference  can  be  quite  emphasixed  in  the  case  of  sharply  nonuniform 
structures. 

Vectorization  techniques  can  be  very  effective  for  bulk  analysis,  since  it  is  sufficient  to 
follow  particles  from  one  scattering  event  to  another  in  parallel.  The  particle  histories  are 
equalized  by  sacrificing  s3rnchronism,  which  is  not  necessary  for  averaging.  Synchronism 
must  be  maintained  for  self-consistent  algorithms,  because  forces  must  be  evaluated  at 
specified  intervals.  Applications  using  an  ensemble  constant  time  approach  for  the  flight 
time  evaluation,  yield  a  vectorization  speed-up  between  3-5  on  a  CRAY  Y-MP  supercom¬ 
puter  [11],  where  the  maximum  possible  speed-up  is  about  10.  Reports  by  several  groups 
indicate  similar  speed-up  for  different  Monte  Carlo  implementations.  Comparisons  in  this 
area  are  extremely  difficult,  since  when  the  efficiency  of  a  code  is  improved,  the  achievable 
vectorization  speed-up  tends  to  decrease. 

Parallelization  can  also  be  very  advantageous  for  bulk  calculations,  since  the  particles 
are  substantially  independent.  For  more  complicated  self-consistent  models,  performance 
depends  on  the  actual  architecture  of  the  hardware  and  on  the  strategy  used  to  balance 
the  load  on  the  processors.  Work  in  this  area  is  still  largely  experimental,  reflecting  the 
immaturity  of  parallel  computers  and  compilers.  Parallelization  is  particularly  appealing  for 
3-D  simulation,  because  realistic  applications  requite  a  very  large  number  of  particles  [12]. 
Balancing  of  the  load  between  processors  is  very  important,  because  for  massively  parallel 
applications  even  a  small  percentage  of  non-parallelizable  code  may  make  the  computation 
inefficient. 

The  future  evolution  of  parallel  architectures  will  have  an  important  influence  on  Monte 
Carlo  applications.  This  is  particularly  true  for  the  most  advanced  applications  which 
require  the  storage  of  large  tables.  The  most  memory  intensive  model  is  at  the  top  of 
the  hierarchy,  where  tables  for  the  full  bandstructure  and  momentum-dependent  scattering 
rates  must  be  stored  and  need  to  be  equally  accessible  by  all  the  processors.  The  information 
is  only  read  by  the  processors  during  the  simulation,  since  these  tables  are  not  changed. 
Therefore,  for  efficiency,  such  tables  should  reside  in  a  shared  memory  region  which  can 
be  quickly  accessed  by  all  nodes  with  uniform  times,  rather  than  being  distributed  in  local 
memory  areas  appended  to  the  processors  or  being  completely  copied  in  each  of  these  areas, 
to  limit  storage  requirements.  The  remaining  compelling  memory  requirements  are  related 
to  particle  attributes,  like  position  and  momentum,  which  are  continuously  updated,  and 
position  dependent  data  (charge,  fields)  for  large  grids,  particularly  in  3-D.  The  main 
issue  is  to  efficiently  handle  the  communication  between  blocks  of  distributed  memory.  A 
logical  storage  scheme,  in  the  case  of  distributed  memory,  is  to  map  particles  and  grid 
nodes  of  domain  subregions  onto  separate  processors,  trying  to  balance  the  number  of 
particles  per  processor,  adaptively  throughout  the  simulation.  As  particles  cross  boundaries 
between  subregions,  they  should  be  reassigned  to  new  processors.  For  the  determination 
of  charge  on  grid  nodes  and  the  subsequent  solution  of  Poisson  equation,  a  small  amount 
of  communication  is  necessary  between  processors  corresponding  to  physically  contiguous 
regions.  Much  more  challenging  is  the  implementation  of  molecular  djmamics,  where  in 
principle  all  processors  need  to  communicate  with  each  other.  The  optimal  solution  for  a 
Monte  Carlo  code  would  be  to  have  only  shared  mmnory  available.  In  such  a  way  it  would 
be  possible  to  load  particles  on  fixed  processors,  regardless  of  position.  Provided  that  each 
single  processor  can  address  a  large  shared  memory  necessary  for  the  applications,  still 
great  challenges  remain  in  designing  system  software  and  compilers  for  such  a  system. 
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V.  Force  Evaluation 


The  numerical  solution  of  Poisson’s  equation  only  provides  forces  in  a  quasiststic  approx¬ 
imation.  In  some  applications  this  is  not  sufficient.  It  has  been  shown  that  to  simulate 
fast  phenomena  associated  with  the  transport  of  carriers  generated  by  femtosecond  laser 
pulses,  the  full  Lorents  force  must  be  evaluated  [13],  where  the  electric  and  magnetic  fields 
should  be  obtained  by  solving  the  time  dependent  Maxwell’s  curl  equations  (note  that  we 
refer  here  the  fields  are  generated  by  the  fast  moving  charge  particles,  not  to  the  laser 
radiation  which  is  absorbed  by  the  sample)  Implementation  of  the  algorithm  can  be  very 
suitable  to  parallel  computation  and  actually  cheaper  than  solving  the  Poisson’s  equation 
for  multi-dimensional  simulations  [12]. 

It  is  not  clear  at  which  frequency  range  the  inclusion  of  the  magnetic  field  begins  to  be 
necessary.  In  the  simulation  of  general  microwave  devices,  it  should  be  possible  to  simply 
substitute  Poisson’s  equation  with  the  time-dependent  wave  equation  for  the  retarded  po¬ 
tential,  which  has  the  same  space-dependent  terms,  to  accotmt  for  the  displacement  currents 
(in  1-D  the  displacement  current  contribution  can  be  integrated  and  applied  as  additional 
boundary  conffition  to  Poisson’s  equation).  However,  in  the  THs  regime  the  wavelength  of 
the  electromagnetic  field  in  the  doped  semiconductor  layers  csm  be  comparable  or  smaller 
than  the  dimensions  of  the  active  regions, 

In  other  cases,  the  solution  of  Poisson’s  equation  on  ducrete  points  may  not  be  accurate 
enough  to  resolve  the  coulomb  interaction  ^ween  charges.  Reduction  of  the  mesh  size 
may  not  improve  much  the  situation,  because  the  numba  of  simulated  carriers  is  fixed.  A 
molecular  dynamics  can  be  used  to  calculate  the  force  acting  on  a  particle  by  adding  the 
coulomb  potential  due  to  all  the  other  charges.  This  approach  would  automatically  include 
the  electron-electron  interaction  effects,  which  can  be  very  important  in  the  case  of  high 
concentrations.  The  major  computational  obstacle  is  in  the  l^t  that  the  full  molecular 
dynamics  evaluation  of  the  forces  involves  a  number  of  operations  of  order  where  N 
is  the  number  of  particles.  In  order  to  develop  practical  ^orithms,  it  should  be  possible 
to  apply  multipole  techniques,  which  have  been  extensively  used  to  calculate  molecule 
configurations,  ionic  systems  [7]  and  capacitances  in  complicated  VLSI  interconnect  layouts 
[8],  to  name  a  few  applications.  With  accurate  calibration,  multipole  algorithms  only 
require  a  number  of  operations  of  order  N.  The  idea  is  to  consider  particle-particle  forces 
only  within  an  appropriate  neighborhood,  and  treat  interactions  from  longer  range  particles 
through  interpolated  forces  on  a  mesh  with  increasing  coarseness  at  farther  distances. 

In  many  applications  it  is  common  to  simulate  only  particles  in  a  relatively  small  cell  of 
a  periodic  structure,  or  a  sample  of  a  larger  device.  When  the  Molecular  Dynamics  method 
is  implemented,  the  interactions  with  <^aiges  in  other  regions,  which  are  not  simulated, 
cannot  be  neglected.  H  one  assumes  that  the  simulated  geometric  sample  is  one  element 
of  a  periodic  structure,  every  particle  in  the  simulated  cell  corresponds  to  a  ’’replica”  in 
each  of  the  other  cells  of  the  periodic  domain.  The  replicas  of  a  given  particle  constitute 
then  a  ’lattice”  of  charge,  and  special  techniques  must  be  applied  to  get  a  convergent  sum 
of  the  interactions  keeping  down  the  computer  time  requirements  [14].  For  simulation  of  a 
bulk  material,  one  has  to  consider  3-D  lattices  of  replicas,  2-D  and  1-D  lattices  for  1-D  and 
2-D  device  simulations,  respectivdy,  and  in  the  case  of  a  complete  3-D  device  simulation 
no  replica  has  to  be  taken  into  account.  Although  more  accurate  than  Poisson’s  equation, 
a  mdecular  dynamics  algorithm  is  still  providing  forces  in  an  electrostatic  approximation. 


VI.  Hybrid  iedmiques 
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Monte  Carlo  techniques  have  found  many  useful  applications  beyond  full  self-consistent 
simulation  of  devices.  Assuming  that  the  correct  solution  of  the  Boltzmann  equation  is 
obtained,  Monte  Carlo  simulations  are  often  used  to  parameterize  other  mod^.  Field 
dependent  mobility  and  diffusion  coefficient  are  used  in  drift-diffusion  applications,  for 
instance,  and  a  number  of  other  parameters  are  ertracted  to  calibrate  hydrodynamic  and 
energy  transport  models.  Although  the  Monte  Carlo  results  should  not  always  be  trusted 
as  completely  exact,  the  potential  problems  in  these  schemes  are  more  due  to  the  fact  that 
parameters,  obtained  for  a  bulk  with  uniform  field,  are  often  employed  for  nonuniform  field 
condition,  which  can  be  considerably  different. 

The  Monte  Carlo  approach  is  also  used  as  a  postprocessor  using  the  potential  profile 
obtained  by  a  drift-diffusion  or  hydrodynamic  approach.  This  is  certainly  a  val'^l  approach 
for  large  devices,  where  not  only  a  drift-diffusion  solution  is  acceptable,  but  also  full  self- 
consistent  Monte  Carlo  solution  would  be  impractical.  By  tracking  particles  simulated  with 
Monte  Carlo  in  the  fixed  potential,  it  is  possible  to  evaluate  high  energy  effects  (injection 
into  oxide,  impact  ionization)  which  are  not  well  account  for  in  simpler  models.  The 
advantage  of  a  postprocessor  is  in  the  fact  that  one  can  use  very  efficient  vector  and  parallel 
algorithms,  since  the  tracked  particles  are  essentially  independent  in  nonself-consistent 
simulations. 

A  recent  application  of  Monte  Carlo  simulation  involves  the  calculation  of  tables  for 
a  scattering  matrix  technique  [15].  The  Monte  Carlo  procedures  provides  the  possible 
outgmng  momentum  values,  with  associated  probabilities,  for  a  given  incoming  momentum 
into  a  thin  slab  of  the  device.  By  partitioning  the  device  into  slabs  and  matching  the 
solutions  at  the  interfaces  with  the  momentum  scattering  tables,  it  is  possible  to  get  a 
solution  which  provides  all  the  information  of  a  sdf-con^tent  Monte  Carlo,  but  with 
smooth  solutions  without  the  noise.  Because  of  this,  rare  events  should  be  easier  to  observe 
directly.  Multi-dimensional  applications  are  also  possible  with  this  technique. 

Vn.  Improved  Physical  Models 

A  shortcoming  of  Monte  Carlo  models  is  due  to  the  unavailability  of  many  constants  which 
are  necessary  to  determine  the  scattering  rates.  A  typical  example  is  represented  by  de¬ 
formation  potentials  of  phonon  scatterings.  The  usual  procedure  is  to  choose  a  set  of 
deformation  potentials  which  fit  experimental  data  for  steady-state  velocity-field  charac¬ 
teristic  curves.  Unfortunately,  many  slightly  different  sets  can  be  found  which  provide  a 
reasonable  fit.  Measurements  of  some  important  parameters,  like  the  intervalley  deforma¬ 
tion  potential  for  transitions  between  F  and  £  valleys  in  GaAs,  have  been  attempted,  but 
the  data  reported  by  various  groups  are  too  contradictory  to  resolve  the  uncertainty. 

The  solution  is  to  formulate  new  models  which  are  based  on  first  principles  and  rely 
less  on  parameter  fitting.  Electron-phonon  scattering  is  usually  treated  with  a  simplistic 
dispersion  relation,  and  is  defined  on  arbitrary  partitions  of  the  Brillouin  zone  centered 
around  energy  minima  (valleys).  At  high  fields,  the  validity  of  this  picture  is  questionable. 
A  complete  phonon  model  with  accurate  dispersion  relations  is  needed,  but  the  computa¬ 
tional  complexity  is  formidable.  A  consistent  approach  should  treat  both  the  bandstructure 
and  the  electron-phonon  interaction  on  the  same  footage. 

Electron-phonon  matrix  elements  of  the  true  Hamiltonian  are  equivalent  to  the  matrix 
elements  of  a  pseudo-Hamiltonian,  as  long  as  certain  conditions  on  the  true  potential  are 
met.  Several  calculations  have  been  performed  for  semiconductors  using  local  and  nonlocal 
empirical  psendcqiotentials.  These  are  assumed  to  be  the  sum  of  spherical  potentials  which 


move  rigidlj  with  the  etonu  [16,17].  Another  approach  uses  the  initio  pseudopotentials, 
which  avoid  self'conaistency  problems  of  the  empirical  approach  by  approximating  the 
change  in  charge  density  when  the  atoms  are  displaced  (18).  Computations  using  the 
Harris  functional  approach  have  provided  the  deformation  potentials  for  phonon  scattering 
in  Si,  throughout  the  Brillouin  sone.  A  significant  result  is  the  evidence  of  the  variation 
of  deformation  potentials  with  initial  and  final  state  wavevector.  The  total  deformation 
potential  exhibits  a  high  degree  of  dispersion,  especially  for  transitions  away  from  the  valley 
minima  [18].  These  results  have  significant  impUcations  for  improving  the  predictive  power 
of  high-field  Monte  Carlo  simulations. 


Vm.  Conclusions 

Monte  Carlo  techniques  for  device  simulation  are  undergoing  dramatic  developments  due  to 
the  recent  evolution  of  available  computational  platforms.  Full  bandstructure  applications 
are  already  practical  for  use  on  top  of  the  line  workstations.  New  optimization  techniques 
for  the  determination  of  momentum  space  trajectories,  new  approaches  for  electron-phonon 
interaction  which  remove  much  of  the  uncertainties  of  current  models,  and  emerging  ap¬ 
plications  on  parallel  architectures,  should  contribute  to  provide,  in  the  next  few  years, 
accurate  and  efficient  simulators  with  sufficient  predictive  capability  in  the  high  field  trans¬ 
port  regime  to  meet  the  needs  of  CAD  designers  of  new  generations  of  integrated  devices. 
Monte  Carlo  techniques  have  already  found  a  very  important  role  as  tools  for  the  calibra¬ 
tion  of  simpler  models,  and  as  postprocessing  complements  of  conventional  simulators,  to 
quickly  assess  the  importance  of  hot  electron  phenomena  affecting  device  reliability. 
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Abstract 

The  role  of  electron-hole  scattering  on  the  low  field  mobility  of  electrons  in  p-doped  sili¬ 
con  is  examined  using  an  Ensemble  Monte  Carlo  approach.  The  transport  model  includes 
both  electron  and  hole  dynamics  with  coupling  between  the  electron  and  hole  systems 
through  coulombic  electron-hole  interactions.  The  electron-hole  scattering  rate  is  calcu¬ 
lated  using  the  elliptical  nature  of  the  conduction  band  valleys  and  non-parabolidty  of 
the  valence  band.  Minority  electron  low-field  drift  mobilities  are  calculated  at  acceptor 
concentrations  of  4.5xl0^*cm~^  and  3.8xl0^*cm~^  at  300K.  Calculations  are  in  excellent 
agreement  with  experimental  values. 


I.  Introduction 

ESxperimental  investigations  of  minority  electron  transport  in  silicon  report  a  consider¬ 
able  reduction  in  electron  mobility  in  the  presence  of  an  ^>plied  electric  field(l,2].  This 
decrease  is  attributed  to  the  drag  effect  of  electron-hole  scattering  which  is  a  result  of 
the  net  momentum  transfer  from  electrons  to  holes.  In  this  work,  Ensemble  Monte  Carlo 
techniques  are  used  to  examine  the  low-field  mobility  of  minority  electrons  in  silicon. 

Previous  microscopic  studies  of  minority  electron  transport  using  Monte  Carlo  techniques 
have  been  conducted  for  GaAs[3,4]  and  silicon[5,6].  FWuto  and  coworkers[3]  compared 
experimental  drift  velocity  in  p-doped  GaAs  with  Monte  Carlo  calculations.  The  hole 
transport  model  used  in  their  work  contained  only  heavy  holes  and  simulated  hole  dy¬ 
namics  with  a  drifted  Maxwellian  distribution.  Their  hole  model  ignores  the  complex 
nature  of  the  valance  band.  Sedra  and  coworkers[4]  used  a  more  accurate  hole  trans¬ 
port  model  and  performed  simulations  using  a  single  particle  Monte  Carlo  technique.  A 
shortcoming  of  both  their  works  and  our  previous  work  in  silicon[5,6]  is  the  use  of  the 
Brooks-Herring  impurity  scattering  model.  This  model  is  based  on  the  first  Bom  approx¬ 
imation  and  is  invalid  for  impurity  concentrations  above  10*^cm~’  at  room  temperature. 
This  was  clearly  shown  in  our  previous  calculations  at  3.8xl0^*cm~’[6]  where  calculated 
low  field  mobility  overestimated  experimental  values[l].  A  new  impurity  scattering  model 
for  Monte  Carlo  calculations  was  developed,  to  correct  this  problem[7]. 

In  this  contribution  we  report  low  field  drift  mobility  of  electrons  in  room  temperature 
silicon  for  acceptor  concentrations  of  of  4.5xl0**cm~’  and  3.8xl0**cm~’  at  300K.  Cal¬ 
culations  are  reported  using  an  improved  hole  transport  model[8]  and  impurity  scattering 
model[7]. 
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II.  IVaiisport  Model 

The  transport  model  includes  the  dynamics  of  both  electrons  and  holes.  Modeling  of  both 
carriers  is  required,  due  to  the  strong  effect  of  the  hole  plasma  on  electron  transport.  The 
hole  transport  model  includes  warped  non-parabolic  heavy  and  light  hole  bands  and  a 
spherical  spin-orbit  band.  The  non-parabolic  nature  of  the  heavy  and  light  hole  bands  is 
represented  by  an  energy  dependent  effective  mass.  The  scattering  mechanisms  taken  into 
accoimt  are  intra  and  inter-band  elastic  acoustic  and  non-polar  optical  phonon  scattering. 
More  detail*  on  the  hole  transport  model  along  with  comparisons  to  experimental  values 
are  repoHed  elsewhere[8]. 

The  electron  transport  model  contains  all  six  non-parabolic  ellipsoidal  n-valleys.  Three  in¬ 
tervalley  p-type  phonon  scattering  between  parallel  valleys,  three  intervalley  f-type  phonon 
scattering  between  perpendicular  valleys,  elastic  intervalley  acoustic  phonon  scattering, 
ionized  impurity  scattering,  and  electron-hole  scattering.  The  phonon  scattering  and  ma¬ 
terial  parameters  given  by  Brunetti  and  coworkers[9]  and  the  intra  and  inter- valley  phonon 
scattering  rates  calculated  by  Jacoboni  and  Reggiani[10]  are  used  in  this  work.  Since  cal¬ 
culations  are  reported  at  low  fields  and  doping  concentrations  greater  than  10*^cm~^, 
impurity  scattering  models  based  on  the  first  Bom  approximations,  such  as  the  Brooks- 
Herring  model,  overestimate  experimental  mobility.  For  this  reason  a  new  impurity  scat¬ 
tering  model  was  developed  and  implemented  in  our  Monte  Carlo  program  [7].  The  model 
and  method  used  to  implement  electron-hole  scattering  is  discussed  in  the  next  section. 


III.  Electron-Hole  Scattering  Rate 

The  interaction  between  an  electron  and  hole  is  assumed  to  be  a  screened  coulomb  po¬ 
tential.  The  probability  of  an  electron  and  hole  making  a  transition  from  the  initial  state 
(k«,  kh)  to  a  final  state  (k^,  k^)  as  a  resxilt  of  a  Coulombic  interaction  is  given  by  th'' 
Fermi  Golden  rule  for  first-order  transistions.  By  summing  over  all  final  electron  and  hole 
states  and  determining  whether  or  not  the  final  states  are  occupied  after  the  scattering 
mechanism  is  chosen[10],  the  scattering  rate  becomes 

kh)  = 

where  m«  is  the  free  electron  mass,  p  is  the  hole  concentration,  m  is  defined  as 


where 


Mi  mh  mi 

and  (t  specifies  the  direction  x,y,z) 


where 
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Figure  1:  Minority  Electron  mobility  in  room  temperature  silicon  as  a  function  of  applied 
electric  field  for  an  acceptor  concentration  of  4.5xl0**cm~^. 

The  relative  wavevector  g  is  made  more  complex  by  the  direction  dependent  electron 
mass  and  energy  dependent  hole  mass.  In  its  present  form,  equation  (1)  is  not  suitable 
for  calculating  scattering  rates  in  an  Ensemble  Monte  Carlo  program,  since  the  hole 
distribution  function  is  not  known  before  the  scattering  event  is  chosen.  To  overcome 
this  problem,  the  method  proposed  by  Brunetti  and  coworkers[9]  is  used  where  the  term 
replaced  by  its  maximum  value  of  1/2P*.  By  using  the  maximum  value, 
the  resulting  scattering  rate  is  the  scattering  rate  for  electron-hole  scattering  plus  a  self¬ 
scattering  rate  and  a  rejection  method  is  used. 

rV.  Resulti 

Figures  (1)  and  (2)  show  minority  electron  mobility  calculations  and  experimental  data 
as  a  function  of  applied  electric  field  for  acceptor  concentrations  of  4.5xl0^’cm~^  and 
3.8xl0^*cm~^,  respectively.  Majority  electron  mobility  is  shown  for  comparison.  As 
these  figures  show,  there  is  good  agreement  between  experimental  results  report  by  Tang 
and  coworkers[4]  and  Monte  Carlo  calculations.  The  experimental  results  of  Tang  and 
coworkers[4]  show  a  45%  decrease  from  the  sero  field  mobility  at  only  lOOV/cm  at  both 
acceptor  concentrations  while  Monte  Carlo  calculations  show  a  40%  decrease  under  the 
same  conditions.  This  dramatic  decrease  in  mobility  when  an  electric  field  is  applied 
is  due  to  the  drag  effect  of  electron-hole  scattering.  After  the  initial  decrease,  mobility 
essentially  stays  constant  as  the  electric  field  is  increased.  This  saturation  of  the  drag 
effect  is  due  to  the  inherent  Coulombic  nature  of  electron-hole  scattering.  As  the  electric 
field  is  increased,  the  velocity  of  both  carriers  also  increase  resulting  in  a  larger  exchange 
of  momentum  during  electron-hole  scattering.  But,  as  with  any  Coulombic  interaction, 
as  the  velocity  increases  the  scattering  cross  section  decreases.  The  larger  exchange  of 
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Figure  2:  Minority  Electron  mobility  in  room  temperature  silicon  as  a  function  of  applied 
electric  field  for  an  acceptor  concentration  of  3.8xl0^*cm~’. 


momentum  is  countered  by  a  decrease  in  the  frequency  of  the  interactions  resulting  in  a 
saturation  of  the  drag  effect. 


V.  Conclusion 

Minority  electron  transport  in  room  temperature  silicon  has  been  examined  using  an  En¬ 
semble  Monte  Carlo  approach  that  includes  the  dynamics  of  both  electrons  and  holes.  A  i 

theoretical  expression  for  electron-hole  scattering  which  takes  into  account  the  ellipsoidal 
nature  of  the  conduction  band  valleys  and  non-parabolidty  of  the  valence  band  was  devel¬ 
oped  and  implemented.  Improved  hole  transport  and  impurity  scattering  models  were  also 
implemented.  Using  the  model  discussed  in  sections  (II)  and  (III),  calculations  are  in  ex¬ 
cellent  agreement  with  experimental  results.  Both  calculated  and  experimental  data  show  J 

a  dramatic  decrease  from  the  sero  field  mobility  when  the  electric  field  is  only  lOOV/cm. 

This  reduction  is  attributed  to  the  drag  effect  of  electron-hole  scattering. 
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Abstract 

Hole  transport  in  the  warped  band  model  of  GaAs  has  been  investigated  using  em  En¬ 
semble  Monte  Carlo  approach.  Scattering  rates  used  in  the  simulations  were  derived 
for  warped  heavy  and  light  hole  bands  and  for  a  spherical  split-off  band.  The  obtained 
velocity-field  characteristics  are  in  better  agreement  with  experimental  data  than  re¬ 
cently  published  values  where  only  the  heavy  and  light  hole  bands  were  included  in  the 
model  and  warping  was  accounted  for  by  an  approximate  overlap  function.  Simulations 
show  that  a  better  fit  to  the  experiment  and  to  results  of  a  realistic  band  structure 
model  can  be  attained  with  an  optimal  choice  of  valence  band  parameters. 

I.  Introduction 

The  performance  of  semiconductor  devices,  such  as  bipolar  transistors,  p-channel  field- 
effect  transistors  and  hetero junction  bipolar  transistors,  is  determined  by  hole  trans¬ 
port  proprieties.  In  some  earlier  studies,  warping  of  the  valence  band  has  been  either 
ignored[l]  or  taken  into  account  by  the  use  of  an  approximate  overlap  function[2].  The 
split-off  band  has  also  been  mostly  ignored  due  to  its  relatively  low  population.  How¬ 
ever,  warping  and  the  effect  of  the  spin-orbit  interaction  on  the  heavy  and  light  hole 
band  cannot  be  overlooked  if  precise  hole  transport  modeling  is  an  issue.  Breiman  and 
coworkers[3,  4]  used  the  Monte  Carlo  method  in  their  investigation  of  hole  transport  in 
a  realistic  band  structure.  However,  high  precision  is  achieved  that  way  at  the  expense 
of  CPU  time.  Recently  BrudevoU  and  coworkers[5]  pointed  to  the  fact  that  scattering 
rates  often  used  in  literature  for  ionized  impurity  scattering,  inter-band  polar  optical 
phonon  scattering  and  acoustic  phonon  scattering  rates  require  corrections.  BrudevoU 
and  coworkers  [2]  studied  the  warm  and  hot  hole  drift  velocity  in  GaAs  using  a  valence 
band  model  made  of  a  heavy  and  a  light  hole  band,  and  warping  of  the  valence  bands 
was  taken  into  accoimt  by  the  use  of  an  approximate  overlap  function. 

In  this  paper,  hole  transport  in  a  warped  valence  band  model  of  GaAs  is  investigated 
using  an  Ensemble  Monte  Carlo  program.  The  scattering  rate  calculations  for  intra-  and 
inter-band  hole  scattering  take  warping  and  the  overlap  integrids  into  account.  Hole 
velocities  were  obtained  for  different  sets  of  warping  parameters  and  for  fields  up  to 
lOOkV/cm.  Simulation  results  using  this  simple  model  are  compeired  to  the  results  of 
BrudevoU  and  coworkers[5],  Brennan  and  coworkers[3],  and  to  the  experimental  data 
of  Holway  and  coworkers[6].  The  Monte  Carlo  model  used  in  this  work  is  presented 
next.  The  results  of  the  simulation  are  discussed  in  section  III.  Conclusions  are  given 
in  section  IV. 


II.  The  Monte  Carlo  Model 


The  Monte  Carlo  approach  is  used  in  this  work  to  investigate  hole  transport  for  fields 
applied  along  the  (100)  direction.  The  valence  band  model  includes  warped  heavy  and 
light  hole  bands  and  a  parabolic  split-off  band.  For  the  warped  bands,  the  energy  is 
given  by: 

=  (1) 

where 

+  (C/j4)*(sin*  6  cos*  0  +  sin^  0  cos*  ^  sin*  tf)|  (2) 

Above,  the  -t-(-)  sign  correspond  to  the  light(heavy)  hole  band,  0{^)  is  the  polar(azimuthal) 
angle  of  k,  and  A,  B,  and  C  are  the  inverse  band  mass  parameters.  The  set  of  param¬ 
eters  (A=-6.98,  B=-4.5,  C=6.2)  given  by  Madelung[7],  and  the  sets  (A=-7.98,  B=- 
5.16,  C=6.56)  and  (A=-7.65,  B=-4.82,  C=7.7)  used  by  Brudevoll  and  coworkers[5]  are 
adopted  in  simulations.  The  mass  of  the  holes  in  the  warped  bands  is  direction  depen¬ 
dent  and  is  given  by: 

For  the  split-off  band,  a  spherical  equi-potential  surface  is  assumed  with  a  scalar  ef¬ 
fective  mass.  Scattering  mechanisms  included  are  the  acoustic  phonon,  optical  phonon 
scattering,  screened  polar  optical  phonon,  with  intra-  and  inter-band  transfers  for  all 
three  bands.  Impurity  scattering  is  not  included  for  simplicity. 

The  scattering  rates  contain  overlap  factors  which  are  approximated  by  functions  de¬ 
rived  by  Wiley [8].  The  resulting  expressions  become  easy  to  use  in  Monte  Carlo  calcu¬ 
lations.  In  the  particular  case  of  ionized  impurity  scattering,  Jacoboni  and  Reggiani[9] 
propose  to  neglect  a  weak  warping  in  the  pnx^ss  of  scattering  process  determination, 
but  use  a  corrected  effective  mass.  Warping  woidd  be  taken  into  account  in  final  state 
determination.  Such  approach  is  justified  as  warping  has  a  noticeable  effect  on  next 
state  determination  rather  than  on  the  values  of  the  scattering  rates.  Tiersten[10],  for 
example,  found  that  the  valence  band  anisotropy  can  result  in  hole  mobility  variation 
of  up  to  20%,  while  the  scattering  rates  vary  by  less  than  l.%.  In  our  Monte  Carlo 
program,  scattering  rates  are  evaluated  at  each  time  step  so  that  angular  evolutions 
could  be  observed.  The  scattering  rates  for  the  acoustic  and  optical  non-polar  scatter¬ 
ing  are  weighted  by  an  overlap  factor  of  1/2  while  the  polar  optical  transition  rates  are 
integrated  taking  the  overlap  factor  into  account.  The  final  states  are  determined  by 
applying  the  rejection  technique  to  the  exact  angular  probability  functions  containing 
angular  contributions  firom  both  warping  and  the  overlap  integral.  The  results  of  the 
simulations  will  be  presented  in  the  following  section. 

ni.  Results  and  Discussion 

The  velocity-field  characteristics  were  calailated  for  the  current  model  at  300K  lattice 
temperature.  Figure  1  shows  the  obtained  velocities  for  the  same  parameters  used  by 
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Brudevoll  and  coworkers  [2].  Also  shown  are  the  experimental  results  of  Hoi  way  and 
coworkers[6].  The  recent  results  of  Brudevoll  and  coworkers  overestimate  the  hole  drift 
velocity  at  all  fields.  It  is  obvious  from  figure  1  that  the  current  calculations,  which 
take  into  accoimt  the  direction  dependence  of  the  hole  mass,  are  in  better  agreement 
with  the  experiment.  The  inclusion  of  the  split-off  band  doesn’t  have  a  strong  effect  at 
low  fields  as  its  hole  population  is  low.  However,  its  contribution  to  the  hole  velocity 
becomes  important  at  fields  higher  than  60kV/cm.  It  is  also  above  that  field  that  the 
current  model  and  the  model  of  Brudevoll  and  coworkers  don’t  realize  a  complete  veloc¬ 
ity  saturation.  Figure  2  illustrates  the  dependence  of  the  hole  drift  velocity  on  warping 
parameters.  The  shown  simulation  results  were  obtained  using  3  sets  of  valence  band 
parameters  and  GaAs  parameters  taken  from  reference  [4].  The  set  (A=-6.98,  B=-4.5, 
0=6.2)  by  Madelung[7]  and  (A=-7.65,  B=-4.82,  0=7.7)  result  in  almost  equal  velocity 
curves  and  band  warping.  These  results  overestimate  the  velocity  at  high  fields  when 
compared  to  experiment2d  data  and  the  results  of  Brennan  and  coworkers  [3]  obtained 
using  a  more  complete  valence  band  model.  However,  for  fields  below  60kV /cm  results 
closer  to  the  experiment  can  be  obtained  by  an  optimal  choice  of  valence  band  param¬ 
eters. 

rv.  Conclusion 

We  have  investigated  hole  transport  in  the  warped  band  model  of  GaAs,  using  an  Ensem¬ 
ble  Monte  Carlo  program.  The  velocity-field  characteristics  for  three  recently  reported 
valence  band  parameters  have  been  shown  to  be  between  the  recent  results  of  Brudevoll 
and  coworkers[5],  the  results  of  Brennan  and  coworkers[4]  and  the  experimental  data 
of  Holway  and  coworkers[6].  The  current  model,  though  simple,  coupled  to  am  optimal 
choice  of  valence  band  parameters,  results  in  velocity-field  characteristics  consistent 
with  previously  published  data.  This  model  could  be  useful  in  the  study  of  anisotropy 
dependent  phenomena,  such  as  orientational  relaxation  of  carrier  momentum. 
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Abstract 

The  effect  of  the  electron  transport  regime  in  the  base-collector  junction  on  the  steady- 
state  and  high-frequency  characteristics  of  a  heterojunction  bipolar  transistor  (HBT)  is 
examined.  A  new  technique  based  on  the  Fourier  analysis  of  the  induced  collector  current  to 
evaluate  HBT  high  frequency  performance  is  used.  The  non-stationary  electron  transport, 
velocity  overshoot  effects  and  effects  of  the  high-current  density  are  taken  into  account 
using  self-consistent  time-dependent  ensemble  Monte  Carlo  particle  simulation. 


I.  Introduction 

Submicrometer  hetero  junction  bipolar  transistors  (HBTs)  are  now  widely  investigated 
because  of  their  great  promise  for  microwave,  millimeterwave  and  ultra-high-speed  digital 
operations.  In  a  typical  N  —  p  —  n  HBT  with  abrupt  emitter-base  heterojunction  electrons 
are  injected  from  wide-gap  emitter  with  considerable  excess  energy  into  a  p-type  narrow- 
gap  base  region.  As  a  result  the  electron  transport  in  the  submicrometer  HBT  is  far  from 
equilibrium.  To  improve  the  HBT  high-frequency  performance  the  different  base-collector 
junction  designs  have  been  proposed  and  investigated  experimentally  in  [1,2]  and  using  nu¬ 
merical  simulation  in  [3-5].  For  these  designs  the  electrons  stay  longer  in  central  valley  and 
therefore  achieve  high  average  velocities  into  the  collector  region.  Thus,  extreme  nonequi¬ 
librium  electrons  transport  in  the  base  and  collector  regions  plays  a  role  in  determining 
device  performances.  Therefore  to  calculate  transistor  electrical  characteristics  with  high 
accuracy  the  numerical  simulation  seems  to  be  the  only  powerful  way. 


II.  Numerical  Model 

To  investigate  the  operation  and  design  principles  of  the  HBTs  one-dimensional  ensemble 
Monte  Carlo  simulator  was  developed  [5].  Our  Monte  Carlo  model  incorporates  complicated 
non-parabolic  electron  energy  spectra  and  all  essential  scattering  mechanisms.  Motion 
of  the  holes  in  the  base  region  and  the  electrons  in  the  heavily  doped  collector  contact 
region  is  evaluated  using  the  drift-diffusion  approiudi.  Two  distinctive  features  are  inherent 
in  the  model.  First  feature  is  concerned  with  the  choice  of  the  self-scattering  events. 
To  minimize  the  number  of  the  self-scattering  events  a  special  procedure  is  used  where 
the  total  scattering  rates  are  represented  by  the  tabulated  momentum-dependent  step-like 
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functions.  These  functions  are  chosen  taking  into  account  local  values  of  the  electric  field 
and  impurity  concentration.  Applications  of  this  technique  permits  to  reduce  the  number 
of  self-scattering  events  to  7%  of  the  scattering  events  total  number. 

Other  novice  of  the  model  is  concerned  with  the  calculation  of  the  HBT  small-signal 
high-firequency  performance.  The  HBT  high-frequency  performance  is  determined  by  the 
base  resistance,  the  emitter  and  collector  capacitances  effects  and  the  carrier  transit  effects 
in  the  base  amd  collector  regions.  To  characterize  the  HBT  as  am  microwave  and  millimeter- 
wave  amplifier  the  cut-off  frequency  fr  is  usually  used.  As  a  rule  the  following  expression 
derived  from  the  smadl-signad  equivalent  circuit  amalysis  is  used  to  estimate  fx  in  a  HBT 
[1]: 


fr  =  {2ir  •  [te  ■  Cbe  +  tbc  +  +  Re Rc)  ■  Cbc]}  ^  (1) 

where  te  is  the  emitter  resistance,  Re  is  the  emitter  series  resistance,  Rc  is  the  collector 
series  resistance,  Cbe  and  Cbc  are  the  emitter  and  collector  capacitance  respectively,  tbc 
is  the  base-to-coUector  transit  delay  time.  In  this  expression  all  the  carrier  transit  effects 
are  considered  by  the  only  carrier  transit  delay  time  through  the  base  and  coUector  tbc 
usually  being  estimated  from  the  steady-state  distribution  of  the  mean  electron  velocity 
[1,6].  It  is  obvious  that  this  tbc  formulation  is  not  absolutely  correct  because  it  does  not 
taJce  into  account  the  electron  velocity  profile  in  the  collector  region  and  detaul  shape  of 
the  electron  distribution  function.  The  last  reaison  is  especially  important  in  the  presence 
of  the  non-stationary  carrier  transport.  Therefore  method  based  on  the  Fourier  analysis  of 
the  non-stearly-state  induced  collector  current  is  used  in  present  model. 

Simple  high-frequency  equivalent  circuit  of  HBT  is  shown  in  Fig.l.  For  this  circuit 
frequency-dependent  common-emitter  short-circuit  current  gain  /i2i  “ay  be  written  ais  fol¬ 
lows: 


/l2l(w)  — 


’ _ (1  -I-  ju^Cflgrg)  •  (1  -f  jijiCbcRc) _ 

.7(0;)  —  jtjjCsci^E  +  Re)  +  ^^Cbe(^bc'>'eRe 


iW)  =  7£(w)  •  ^(w)  •  7c(a;) 


(2) 


where  7^  is  the  emitter  efficiency,  ^  is  the  ba«e  transmission  coefficient  and  7c  is  the 
collector  transmission  coeflicient.  The  values  of  the  resistances  and  capau:itances  in  (2) 
ase  calculated  from  the  numerical  simulation  of  the  required  steauly  state  or  aure  estimated 
from  the  experimental  results.  Further  in  our  analysis,  Cbc,Cbe,‘’'EiRe^Rc  and  7^  aure 
assumed  to  be  independent  on  frequency  and  to  be  dependent  on  the  applied  biais.  The 
only  problem  is  the  calculation  of  the  complex  frequency-dependent  coefficient  7(u;) . 

To  calculate  P{lj)  •  7c(w)  the  Fourier  anadysis  of  the  non-steaidy-state  induced  collec¬ 
tor  current  is  used  [3].  Initially  the  steady-state  simulation  for  the  given  applied  bias  is 
performed  and  stationary  distributions  of  the  potential  and  caurriers  concentrations  aure  cad- 
culated.  After  that  the  electron  bunch  consists  of  several  thousand  electrons  is  launched 
from  the  baue-emitter  interfaKre  during  short  time  interval  and  the  time  dependent  collector 
current  response  Jc(f)  is  calculated.  Here  we  ausume  that  the  injected  non-steauly-state 
electrons  travel  through  base  and  collector  in  the  known  stationairy  electric  field.  When 
adl  launched  electrons  leave  the  modeling  region  the  Fourier  transformation  of  the  injected 
non-stationary  emitter  current  Js(f)  and  induced  collector  current  atre  cairried  out.  The 
frequency-dependent  base-collector  transmission  coefficient  is  defined  as  follows: 
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Fig.l  Small-signal  equivalent  circuit  (a)  and  cut-oflf  firequency  /r  as  a  function  of  the 
collector  current  Jc  (b). 


P{u>)  •  7c(w) 


F[Mt)] 

F[Mt)y 


(3) 


where  F  denotes  the  Fourier  transform.  Cut-off  frequency  /x  is  defined  firom  the  analysis 
of  the  kii{u)  computed  by  (3)  as  a  frequency  where  |h3i(b;)|  =1.  It  should  be  noted  that 
proposed  method  is  also  appropriate  to  calculate  other  HBT  high-frequency  characteristics 
such  as  power  gain,  stability  factor,  scattering  matrix  and  so  on. 

To  verify  proposed  Monte  Carlo  particle  model  and  method  of  evaluation  of  the  HBT 
high-frequency  performance  the  snbmicrometr  HBT  with  the  same  structure  as  in  [1]  was 
simulated.  The  comparison  of  the  simulated  results  with  the  experimentaUy  observed 
results  have  shown  high  accuracy  and  validity  of  the  model  (see  Fig.l). 


in.  Results  and  Discussions 

A  series  of  AUnAs/GalnAs  HBTs  with  arrange  of  base  thickness  Wb  from  0.02/im  to 
0.4/im  and  doped  at  1.5  •  10**cm~^  was  simulated  at  the  temperature  300K  to  investigate 
the  base  thickness  effect  on  the  electron  transport  regime  in  the  base. 


1.06*01  H - 1 - —I - — - 1 - , - , 

0  0.1  0.2  0.3  0.4  0.5 

Wg  ,  um 

(b) 


Fig.2  CommoQ  emitter  current  gain  ^  (a)  and  base  cut-off  frequency  fs  (b)  as  a  function 
of  the  base  thickness  Wb> 


The  HBT  common  emitter  current  gain  P  dependence  as  a  function  of  the  base  re- 
^on  thickness  is  shown  in  Fig.2.  The  analysis  of  this  figure  shows  that  0(Wb)  is  pro¬ 
portional  l/l^B(conesponds  to  ballistic  transport)  for  Wb  <  0.1/im  and  is  proportional 
I/W3  (corresponds  to  diffusive  transport)  for  Wb  >  0.1/im. 

The  base  cut-off  frequency  /b  =  ,  where  tb  is  a  base  delay  time,  dependence 

as  a  function  of  the  base  layer  thickness  is  presented  in  Fig.2.  Solid  curve  in  the  figure 
presents  the  dependence  calculated  by  the  numerical  simulation  (method  based  on  the 
Fourier  decomposition  of  the  non-steady-state  base  current  was  used  in  present  work  to 
estimate  the  tb).  Dashed  curve  presents  the  dependence  calculated  for  diffusive  transport 
and  dotted  curve  corresponds  to  ballistic  transport.  It  is  seen  that  the  ballistic  transport 
take  place  only  for  Wb  <  O.Ofi^m  and  pure  diffusive  transport  occurs  while  the  base 
thickness  is  more  than  0.2/im.  For  0.06/im  <  Wb  <  0.2fim  the  electrons  transport  in  the 
base  region  is  far  from  equilibrium  but  is  neither  pure  ballistic  nor  pure  diffusive.  In  this 
situation  correct  estimation  of  the  base  delay  time  is  possible  only  by  numerical  simulation. 
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Abstract 

We  introduce  a  scaled  ensemble  Monte  Carlo  (SEMC)  technique,  useful  for  obtaining 
statistically  significant  results  far  into  the  low-density  tails  of  carrier  distributions.  Standard 
approaches  for  studying  this  regime  (weighted  EMC)  are  based  on  determining  single-particle 
distribution  functions  using  a  nonuniform  trajectory  sampling;  the  new  technique  instead 
stimulates  a  scaled  distribution,  with  an  energy-dependent  scaling  factor,  but  uses  ordinary 
EMC  weighting.  This  technique  is  flexible  ai^  simple  to  code.  We  display  some  results  for 
bulk  GaAs,  wi£  densities  ran^g  over  seven  orders  of  magnitude,  using  only  10,000  particles. 


I.  General  System  Description 

Our  treatment  is  in  the  classical  regime,  in  the  absmce  of  significant  interpaiticle  conelations,  so 
the  system  is  described  formally  by  the  usual  time-dependent  single-particle  distribution 
function  /(r,p;r) »  /(x:r)  [x  » (r.p)  is  a  phase  space  coordinate].  The  distribution  function 
obeys  the  liouville  equatkxu 

(1) 

where  H  »  H{x)  is  the  time-independent  single-particle  Hamiltonian,  and  {•,*)  is  the  Poisson 
bracket  We  make  die  usual  assumptions  that  collisions  or  scattering  events  tate  place  on  time 
scales  much  shorter  than  the  time  between  collisions,  so  effects  such  as  collision  broadening  can 
be  neglected.  Further,  the  scattering  events  are  in  fact  a|q;m>ximated  as  instantaneous,  so  that 
intracoUisioiuil  field  effects  can  also  be  ignored.  Under  these  assumptions,  all  important 
sources  of  potential  and  phonon  scattering  can  be  written  in  the  form 

»-r(x;r)  /(x;r)  +  J  r(x.x»;r)/(x';r)  dx'  .  (2) 


II.  Scaling  Formalism 

The  approach  developed  depends  primarily  on  the  ob^rvation  that  an  energy-  and  time- 
dependent  scaling  factor  commutes  with  noncollisional  term  on  die  right-hand  side: 

sm)  {H,f]  =  .  (3) 

Thus,  we  define  a  scaled  distribtition  function  defined  1^ 

^(x;f)«j(//(x),f)/(x;r). 


(4) 
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The  inincipal  advantage  of  multiplying  by  an  energy-dependent  factor  arises  from  counting 
statistics.  If  we  are  interested  in  tte  distribution  function  in  some  region  about  the  phase  space 
point  X,  we  consider  as  a  ftmcdon  of  time  the  number  of  simulation  particles  N  in  the  vicini^  of 
that  point  The  fractional  error  in  j  is  then  l/ViV  ,  which  is  the  fractional  error  in  /  s  as 
well.  For  regions  of  low  phase-space  density.  N  is  proportional  to  /(x;r).  Thus,  for  example, 
to  examine  regions  where  the  phase  space  densi^  is  down  by  seven  orders  of  magnitude  from 
the  maximum,  one  needs  many  times  of  ten  million  simulation  particles  to  keep  the  error  from 
exceeding  die  estimate. 

A  standard  solution  to  this  problem  —  weighted  EMC  —  is  to  define  a  region  of  interest  in 
phase  space,  and  to  perform  multiple  simulations  of  those  few  particle  trajectories  which  enter 
it.  Usually,  this  is  coded  in  a  way  that  sharply  distinguishes  high-  and  low-density  regions.  If 
the  density  falls  smoothly,  there  is  no  efficient  place  to  draw  the  boundary  between  these  two 
regions.  Other,  mote  subtle  approaches  have  also  been  used  [1]. 

The  present  approach  is  based  upon  scaling  the  distribution  function  that  is  simulated  by  EMC, 
rather  than  upon  a  weighting  the  EMC  simulation  of  an  unsealed  distribution  function.  The 
choice  of  s  determines  the  trajectory  density.  In  principal,  these  two  approaches  may  be 
equivalent  in  particular  cawses.  However,  a  weighted  EMC  elimiinates  the  usual  identification 
between  individual  initial  particles  and  individual  trajectories  sampled  (in  what  is  the  Monte 
Carlo  integration  of  the  Boltzmann  equation).  As  a  result,  one  loses  the  intuitive  simplicity  of 
regarding  sampled  trajectories  as  individual  particles  of  a  large  ensemble.  In  a  scaled  EMC,  on 
the  other  hand,  one  preserves  a  one-to-one  conespcxidence  of  initial  condition  to  trajectory,  and 
it  remains  possible  to  regard  the  trajectories  sampled  as  the  actual  paths  of  individual  particles. 
As  we  describe  below,  however,  in  order  to  redistribute  the  statistical  sampling  weight,  one 
pays  the  price  that  the  trajectories  do  not  foUow  the  paths  of  ordinaxy  particles. 


III.  Time>dependent  scaling 

Particle-number  conservation  imposes  an  important  constrain|^on  how  s(H(x),t)  is  allowed  to  be 
chosen.  By  appropriate  norm^zation,  the  total  number  N  of  simulation  particles  in  the 
simulation  of  j  is  the  integral  of  the  scaled  distribution  function: 

m  =  J  ?(x)  dx  =  J  s  (x)/(x)  dx  .  (5) 

If  s  is  chosen  to  emphasize  high-energy  regions  which  have  low  density,  then  during  a 
relaxation,  thermalization  will  cause  a  transfer  of  (real)  electrons  to  lower-energy  regions  where 
s  is  smaller.  If  s  is  Mt  allowed  a  time-dependence  to  compensate,  the  total  number  of 
simulation  partictes  of  /  must  decrease  —  degrading  tire  statistics  in  /.  Conversely,  a  heating 
mechaxtism  would  increase  the  number  of  simulation  particles,  improving  statistical  precision 
but  possibly  requiring  undesirable  computational  expense.  By  allowing  s  to  have  a  time- 
dependence,  we  acctnnplish  in  the  time  domain  what  the  energy  ^pendence  of  s  accomplishes 
for  the  energy  domain:  reduce  variations  in  particle  number  so  that  fractional  errors  can  be  kept 
at  an  acceptable  level  throughout  die  region  of  interest,  widi  the  least  computational  effmt. 

We  chose  a  simple  form  for  the  joint  energy-and-time  dependence  of  s:  we  let  them  be 
independent  factors.  This  can  be  written 

s(//,r)  s  exp(a(r)+7(//))  •  (6) 

Furthermore,  we  let  the  function  y(H)  »-H  /  ksT*^.  This  is  appropriate  for  distibutions  / 
which  are  approximately  characterized  by  effectiw  tempmtures  below  T^“  [Higher 
temperatures  lead  to  normalization  problems  with  /.]  This  choice  is  also  convenient 


computationally:  a  linear  function  y  implies  that  inelastic  scattering  rates  as  well  as  elastic  rates, 
are  ^ace-positkw-independent,  and  can  be  stored  efficiendy  in  look-up  tables. 


J  then  obeys  a  modified  Liouville  equation 


(7) 


in  which  the  “collision”  or  scattering  term  is  defined  by 


(8) 


Equations  (2)  and  (8)  specify  completely  the  modified  scattering  term  (d^/dr)co{i.  However,  in 
order  to  implement  a  Monte  Carlo  time-evolution,  one  must  determine  scattenng  rates  for  the 
scaled  problem  which  are  analogous  to  the  out-scattering  rates  r(x;r)  and  the  in-scattering  rates 
r(x,x';r)  of  the  unsealed  problem.  There  is  some  freedom  in  way  this  is  done.  One  well- 
known  degree  of  freedom  is  associated  with  self-scattering: 


r(x;t)  -¥  r(x;f)  +  Ar(x;r) 

.  r(x,x';r)  ->  r(x,x';/)  +  Ar(x;r)  8(x-x') 


where  ordinarily  AT  is  chosen  to  make  the  total  out-scattoing  a  positive  constant.  This  is  a  kind 
of  gauge  transformation,  in  which  the  physically-significant  toud  scattering  rate  is  fixed,  while 
unobservable  components  of  the  in-  and  out-scattering  rates  make  the  numerical  implementation 
tractable.  A  more  general  land  of  gauge  transformation  is  made  in  the  scaled  EMC  approach, 
leading  to  off-diagonal  in-scattering  rates  of  the  form 

^(x,x')  =  G(x,x')  +  G(x)  for  *  ^  (10) 

N 


The  second  term  on  the  right-hand  side  leads  to  a  kind  of  attractive  interparticle  scattering.  This 
performs  a  tfile  similar  to  that  of  trajectory  iteration  in  weighted-EMC  approaches:  simulation 
particles  entering  critical  regions  are  given  greater  weight,  and  are  effectively  caused  to  perform 
multiple  traversals.  However,  in  SEMC  tUs  weighting  is  implemented  smoothly,  rather  than 
abrupdy  at  the  boundary  of  a  region  of  interest,  and  it  is  accomplished  with  a  fixed  number  of 
particles  undergoing  essentially  ordinary  scattering. 


IV.  Simulations 

We  have  applied  the  SEMC  technique  to  bulk  GaAs  semiconductor  at  300  K.  We  used 
parameters  (deformation  potentials,  phonon  energies,  band  structure,  etc.)  that  have  been 
confirmed  empirically  in  imvious  simulations  [2].  We  have  specifically  neglected  hot  phonon 
effects  and  Coulomb  scattering,  so  we  have  essentially  modeled  the  electrons  in  intrinsic  GaAs 
with  weak  laser  excitation.  system  was  flowed  to  relax  for  10  ps  from  the  initial  laser 
excitation,  and  the  result  plotted  below  were  obtained  as  an  average  of  the  distribution  function 
during  tire  last  OJ  ps  (i.e.,  an  average  was  perform^  of  results  of  the  last  100  S-ps  observation 
times).  Because  of  the  averaging  procechire  used,  tiiere  is  a  kind  of  local  averaging  of  the 
density,  so  that  the  standard  deviation  of  the  results  from  a  smooth  fit  underestimates  the 
uncertainty  of  the  simulation  results. 

Figure  1  shows  the  density  as  a  function  of  energy  of  electrms  in  the  central  (F)  valley.  An 
ordirury  (unsealed,  unwei^ted)  EMC  simulation  is  ernnpared  with  the  scaled  simulation  using 
T^^  s  400  K.  Both  of  the  simidations  use  the  same  (constant)  number  of  simulation  particles: 
10^.  In  this  valley,  the  SEMC  simulation  appears  to  be  less  smooth  (i.e.,  to  have  greater 
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statistical  uncertainty)  at  low  eneipes.  This  is  in  fact  conect:  the  price  we  pay  for  increasing  the 
numbo’  of  simulation  points  at  higher  energies  is  to  reduce  the  number  at  low  energies,  wirh 
correspondingly  poorer  statistics.  At  higher  energies,  when  the  densities  become  smaller  than 
10~^,  the  usual  EMC  becomes  completely  unreliable,  since  results  correspond  to  single 
simulation  particles  in  a  bin.  The  results  of  the  two  EMC  simulations  are  consistent  (in  the 
sense  that  their  difference  is  not  statistically  significant),  as  they  must  be,  since  they  are 
mathematically  equivalent  in  the  large-N  litniL  The  equivalence  at  high  energies  is  clearly  due  to 
the  large  error  bars  in  the  unsealed  EMC  (not  shown)  which  completely  cover  the  range  of 
densities  to  zero.  There  is  the  appearance  that  the  unsealed  EMC  predicts  a  systematically 
higher  ^nsity  at  high  energy,  since  all  of  the  high-energy  points  plotted  are  above  the 
corresponding  SEMC  points  plotted.  There  are  two  reasons  for  this  appearance.  One  is  the 
local  (time)  averaging  discussed,  which  has  the  effect  of  smoothing  a  random  statistical 
fluctuation  into  an  apparent  systematic  one.  The  other  is  that  on  the  semilogaiithmic  scale,  the 
points  where  EMC  p^cts  zero  density  (and  lie  below  the  SEMC  simulation)  cannot  be  plotted. 
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Figure  1.  Central  conduction  valley  population  density. 


Figures  2  and  3  show  the  same  comparison  as  figure  1,  for  the  L-  and  X- valleys,  respectively. 
The  thresholds  at  zero  kinetic  ener^es  correspond  to  satellite  band  minima  many  times  kfiT 
above  the  F-valley  minimum,  and  so  the  regime  of  potn*  statistes  is  reached  more  quickly.  The 
SEMC  in  each  case  shows  die  smooth  exponential  f^-edf  of  a  neariy  equilibrated  system. 


V.  Conclusion 

We  have  shown  that  a  scaled  EMC  simulation  can  be  implemented  to  significantly  improve  the 
statistics  over  broad  ranges  of  low  real  densities. 
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Abstract 

The  rates  of  electron  scattering  by  acoustic  phonons  in  low  dimensional  structures 
are  calculated.  It  is  shown  that  due  to  uncertainty  of  momentum  conservation  in  low 
dimensional  systems  the  acoustic  phonon  scattering  becomes  essentially  inelastic  in  con¬ 
trast  to  that  in  bulk  materials.  We  propose  simple  procedure  for  incorporation  of  inelastic 
acoustic  phonon  scattering  into  a  Monte  Carlo  technique.  It  is  demonstrated  that  electron 
low-field  mobility  at  temperatures  less  that  100  K  calculated  within  elastic  approxima¬ 
tion  is  greatly  underestimated.  We  have  also  demonstrated  that  velocity-field  dependence 
in  QWIs  has  a  superlinear  region.  This  superlinear  region  is  associated  with  decline  in 
acoustic  phonon  scattering  efficiency  as  the  electrons  get  heated. 

I.  Introduction 

Low  dimensional  (LD)  semiconductor  structures  are  now  widely  recognized  as  a  very 
promising  basis  for  future  technological  applications.  The  electron  transport  in  LD  struc¬ 
tures  at  low  temperatures  is  controlled  by  acoustic  phonon  scattering.  However,  so  far 
there  is  a  considerable  gap  in  understanding  of  some  essential  aspects  of  LD  electron 
scattering  by  acoustic  phonons.  The  elasticity  of  electron-acoustic  phonon  scattering  is 
a  commonly  used  approximation  [1*4].  A  closer  look  at  this  mechanism  shows  that  elec¬ 
tron  scattering  by  acoustic  phonons  in  quantum  structures  becomes  essentially  inelastic 
and  becomes  effective  mechanism  of  energy  dissipation  [5].  This  is  due  to  the  fact  that 
the  momentum  conservation  in  LD  structures  for  electron-acoustic  phonon  systems  is  not 
preserved. 

We  present  results  of  calculations  of  electron  scattering  by  acoustic  phonons  in  ID 
quantum  wires  (QWIs)  and  2D  quantum  wells  (QWs),  where  we  consider  inelasticity  of 
this  scattering  in  full  detail.  We  develop  procedure  for  inclusion  of  this  inelasticity  in¬ 
to  a  Monte  Carlo  technique.  Monte  Carlo  simulation  results  are  obtained  for  GaAs  QWIs. 

II.  Scattering  Rates 

The  rate  of  electron  scattering  from  the  state  k  to  the  state  k'  with  the  assistwce  of 
deformation  acoustic  phonon  with  wave-vector  q  is  given  by 

+  (1) 

where  V  is  the  principal  volume  of  the  crystal,  E^  is  the  deformation  acoustic  potential,  p 
is  the  density  of  the  material,  u^  =  uq  it  the  long  wave  approximation  of  acoustic  phonon 
dispersion,  where  u  is  a  sound  velocity  in  the  material,  and  Ac  is  the  energy  separation 


( 


between  initial  and  final  subbands.  Here  and  everywhere  below  upper  sign  corresponds  to 
emission  and  lower  sign  to  absorption  of  acoustic  phonon.  We  will  consider  a  rectangular 
quantum  structure  with  infinitely  deep  potential  well  embedded  into  another  material  with 

similar  elastic  properties  so  that  acoustic  phonons  can  penetrate  through  the  interface  ( 

between  these  materials.  The  major  difference  between  bulk  and  LD  electron  scattering 
comes  from  the  overlap  integral  I  =  Integrating  over  the  volume  of  the 

quantum  structure  and  neglecting  umklapp  processes  we  get 


G'l./'.s.  = 


\IlD?  = 

2  [l  -  cos(g,L,)] 


+  f‘)  +  »‘(P  -  I’’)’]’ 


{ 

(2) 
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where  n  is  subband  index  for  a  QW;  j  and  I  are  subband  indices  for  a  QWI  due  to  size 
quantization  in  y  and  z  directions,  respectively.  Primed  indices  denote  the  subband  of  the 
final  state.  The  Kroenecker  delta-function  reflects  momentum  conservation  in  unquan¬ 
tized  direction(s).  The  form-factor  G  is  responsible  for  the  uncertainty  of  momentiun 
conservation  in  perpendicular  to  the  quantum  structure  direction(s). 

The  total  scattering  rate  from  the  state  k  of  the  initial  subband  to  any  state  of  the 
final  subband, 

W  =  (2^  E k'.q).  (4) 

The  complexity  of  the  form-factor  does  not  allow  one  to  get  analytical  results  without 
certain  simplifications.  Let  us  consider  now  separately  QWs  and  QWIs.  For  numerical 
estimates  we  use  parameters  of  GaAs  quantum  structure  embedded  in  AlAs  at  T=30  K. 


(a)  Quantum  Wells 

The  scattering  rate  of  2D  electrons  by  acoustic  phonons  obtained  from  (7)  for  electrons 
on  subband  n  with  wave  vector  k||  scattered  to  subband  n'  can  be  rewritten  in  the  following 
form: 


An,n'(l5||)  — 


ElmkeT 

2L,ph^u^ 


(5) 


where  ^n,n'(k||)  is  a  dimensionless  fimction  of  the  order  of  unity,  given  by 


i|  -  *||Co«(v) 


1-1 


kjj  -I-  -  2*||fcf|Cos(v>)  +  9,* 


9{k\\,fP,qz,n,n!) 


(6) 


We  use  the  following  notation  in  formula  (6)  :  ^  is  an  angle  between  electron  wave  vector 
before  scattering  k||  and  electron  wave  vector  after  scattering  k||.  An  absolute  value  of  the 
in-plane  electron  wave  vector  after  scattering,  k[|,  is  a  solution  of  the  energy  conservation 
equation.  The  function  A/’(z)  =  z/(e”  —  1)  -h  z/2  ±  z/2  in  eq.  (6)  is  a  dimensionless 
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function  Mwociated  with  phonon  occupation  number,  is  an  integer- value 

function  which  is  equal  to  a  number  of  the  roots  of  the  energy  conservation  equation. 
For  the  elastic  approximation  huq  — »  0  we  obtain  the  following  expression: 


Jn.n'(*^|)  =  (1  +  -  Cf»'  +  ■^)> 


where  6(z)  is  a  unit  step-function.  The  elastic  approximation  fails  in  the  region  of  low 
electron  energy  which  amount  for  1  —  5  meV  for  GaAs  QWs.  Mathematical  complexity  of 
the  problem  does  not  allow  one  to  solve  it  analytically.  Figure  1  demonstrates  the  energy 

dependence  of  the  acoustic 


phonon  emission  rate  in  three 
lowest  subbands  (intra8ubb2md 
transitions)  of  the  100 A  thick 
QW  for  T=30  K,  obtained 
by  direct  numerical  solution  of 
eqs.  (5)  and  (6).  Instead  of 
single-step-wise  function  which 
follows  froir  '  :  elastic  approx¬ 
imation,  the  emission  rate  is  a 
smooth  function  of  electron  en¬ 
ergy.  The  complex  structure  of 
the  form-factor  for  transitions 
within  upper  subbands  results 
in  several  plateau  regions  in  the 
energy  dependence  of  the  emis¬ 
sion  rate. 


(b)  Quantum  Wires 

Let  us  assume  that  q,  C  qj,  where  qr  =  ^qj  +  q*  is  a  transverse  component  of  phonon 
wavevector  and  that  huq  kaT.  To  meet  both  above  assumptions  it  is  reqmred  that 
<  kaT,  where  qrmas  =  -i-  j'YIL\  +  (/  +  VYIL\  '\s  roughly  the  maximum 

transverse  component  of  phonon  wavevector  where  the  form-factor  is  non-zero.  Then  after 
summation  of  eq.  (7)  over  k'  and  integration  over  q.  we  get 


t  dqydq,  G 

4ir*ptt*h*V  2  J  y/e  ^  httqr  —  Ac 


(7) 


Here  G  stands  for  Gjji^  integration  limits  for  absorption  rate  are  imposed 

by  the  form-factor  which  tends  to  zero  above  qrfnw  The  uncertainty  of  momentum 
conservation  is  roughly  defined  by  hqrfnw,  i.e.,  it  increases  with  decreasing  the  transverse 
dimensions  of  the  QWI.  As  the  electron  energy  tends  to  zero  the  absorption  rate  saturates 
since  qrmM  is  net  equid  to  zero. 

Let  us  consider  the  asymptote  of  very  large  electron  energies  e— Ac  >  huqrntam-  In  this 
case  we  may  neglect  phonon  energy  (elastic  approximation).  Note  that  for  this  asymptote 
emissi<m  and  absorption  rates  are  equal.  Then  (7)  turns  into: 


A(e) 
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(8) 


Thus  the  Ivge  energy  asymptote  diverges  as  function  of  electron  kinetic  energy 
after  scattering,  the  same  as  ID  density  of  states  function.  The  large  energy  asymptote  is 
extended  to  low  energy  region  in  [1-4]  to  evaluate  electron  mobility.  The  scattering  rate 
within  this  elastic  approximation  is  inversely  proportional  to  a  QWI  cross  section  L^xL,. 

Figure  2  demonstrates  the  d- 
ifferencies  between  the  scat¬ 


tering  rate  dependencies  on 
electron  energy  calculated  for 
QWI  with  cross  section  250  x 
150 A’  within  three  different  ap¬ 
proaches:  elastic  approxima¬ 
tion  of  eq.  (8),  the  approach 
of  eq.  (7),  and  the  direct 
numerical  integration  of  eq. 
(4)  without  any  simplification- 
s.  The  correct  inelastic  treat¬ 
ment  of  acoustic  phonon  scat¬ 
tering  yields  qualitatively  dif¬ 
ferent  scattering  rates.  Emis¬ 
sion  and  absorption  rates  with¬ 
in  the  inelastic  approach  are 
considerably  different  at  low 


energies. 

There  is  no  divergency  of  the  scattering  rates  within  inelastic  approach.  The  energy 
given  by  uncertainty  of  momentum  conservation  defines  the  position  of  the  max¬ 
imum  on  emission  curve  and  coincides  with  the  characteristic  energy  of  acoustic  phonon 
interacting  with  electroiu. 


III.  Results  of  the  Monte  Carlo  Simulation 


The  necessity  to  allow  for  inelasticity  of  acoustic  phonon  scattering  in  Monte  Carlo 
simulations  complicates  the  procedure  of  choice  of  the  final  states  for  scattered  electron. 
We  have  developed  very  efficient  procedure  for  random  selection  of  acoustic  phonon  energy 
involved  in  the  scattering.  The  essence  of  this  procedure  is  that  we  first  ntunerically 
perform  von  Neumaim  procedure  for  a  set  of  random  numbers  and  tabrilate  the  phonon 
energy  as  a  function  of  a  random  number.  For  a  QWI  we  have  solved  the  following 
equation: 

dqTF{(tT)  ,  (9) 

with  respect  to  9}  for  a  set  of  100  random  values  of  r  ran^ng  from  0  to  1.  Here  the  function 
F{qj-)  >*  w  integrand  of  eq.  (7)  and  represents  the  scattering  probability  dependence  on 
the  transverse  components  of  the  phonon  wavevector.  Since  97  is  directly  related  to  the 
phonon  energy  within  approximation  9.  <.  97^,  one  can  find  the  desired  phonon  energy 
for  each  value  of  r.  A  table  of  such  values  has  been  calculated  for  a  set  of  electron  energies 

^Within  mm  accnratc  ^qnoach  whm  9.  is  not  negloctsd,  the  acoustic  phonon  energy  is  uniquely 
related  to  fc,  so  that  this  dtepeadence  can  be  used  to  generate  the  acoustic  phonon  energy  [5]. 
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for  emiMion  and  absorption  of  acoustic  phonon  separately.  Then  the  choice  of  phonon 
energy  involved  into  a  scattering  event  is  just  the  generation  of  random  number  r  and 
the  selection  of  the  corresponding  phonon  energy  value  from  the  appropriate  table.  This 
procedure  essentially  speeds  up  the  Monte  Carlo  simulation.  ( 

To  test  the  scattering  rates  and  the  Monte  Carlo  procedure  we  have  simulated  the 
electron  transport  at  low  lattice  temperatures.  Monte  Carlo  simulations  yield  very  accu¬ 
rate  thermal  equilibrium  energy  and  distribution  function  for  electron  system  indicating 
that  acoustic  phonon  scattering  is  treated  correctly.  The  elastic  scattering  model  fails  to 
yield  good  equilibrium  energy  and  distribution  function.  Figure  3  demonstrates  low  field  ( 

electron  mobility  in  the  QWI 
with  cross  section  40  x  40A^  as 
a  function  of  lattice  tempera¬ 
ture.  One  can  see  that  elec¬ 
tron  mobility  calculated  within  ( 

elastic  approach  is  significemtly 
tmderestimated  in  the  low  tem¬ 
perature  region.  The  reason  for 
this  is  obvious:  an  overestima¬ 
tion  of  acoustic  phonon  scat-  i 

tering  rate  (especially  emission 
rate)  at  low  energy  region.  It 
must  be  noted  that  in  thicker 
QWIs  the  discrepancy  between 
low  field  electron  mobilities  ob-  ( 

tained  within  elastic  and  inelas¬ 
tic  models  is  less  pronounced 
because  inelasticity 

We  have  also  observed  superlinear  region  on  velocity-field  dependence  associated  with  4 

reduction  of  acotistic  phonon  scattering  efficiency  as  the  electron  system  gets  heated.  This 
superlinear  region  appears  only  for  rather  thick  QWI.  In  thin  QWIs  (cross  section  of  the 
order  of  40  x  40A^ )  the  acoustic  phonon  scattering  is  so  strong  and  inelastic  that  it  prevents 
electron  heating  up  to  very  high  electric  fields.  The  superlinearity  on  velocity-field  depen¬ 
dence  is  greatly  overestimated  within  elastic  approximation  since  elastic  scattering  does  ( 

not  prevent  electron  runaway  from  the  low  energy  region.  As  a  restilt,  the  velocity-field 
dependences  calculated  within  elastic  and  inelastic  approaches  are  considerably  different. 
Acknowledgement:  This  work  is  supported  by  the  NSF  and  by  ARO. 
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Highly  Efficient  Full  Band  Monte  Carlo 
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Abstract 

We  present  a  full  band  monte  carlo  algorithm  based  on  phase-space  sim- 
plexes  which  has  all  of  the  advantages  of  analytical  band  monte  carlo  while 
preserving  the  accuracy  of  a  full  band  structure.  An  adaptive,  contour-aligned 
grid  algorithm  is  used  to  represent  the  energy  band  structure  within  the  ir¬ 
reducible  wedge  and  is  calculated  once  for  each  semiconductor  band.  The 
complexity  of  generating  the  adaptive  grids  in  phase-space  is  more  than  com¬ 
pensated  for  by  the  simplicity  and  efficiency  introduced  into  the  equations  of 
motion  and  final  state  selection,  which  are  treated  exactly  within  the  physics 
of  the  model.  Consequently,  our  method  confines  simulation  error  to  statisti¬ 
cal  error  and  a  consistently  bounded  discretization  error  determined  only  by 
the  chdice  of  phase-space  grid  and  can  be  set  to  a  user-defined  tolerance.  Re¬ 
sults  using  this  method  show  at  least  an  order  of  magnitude  improvement  in 
performance  over  previous  full  band  codes. 

Monte  Carlo  simulation  has  earned  an  important  place  in  semiconductor  transport 
simulation  because  it  offers  a  practical  way  of  solving  the  full  Boltzmann  equation 
-  essential  for  a  rigorous  understanding  of  non-stationary /high-field  transport  which 
includes  impact  ionization,  hot  carrier  dynamics,  and  velocity  overshoot.  Naturally, 
the  more  rigorously  based  the  physics  in  the  Monte  Carlo  simulation,  the  more  accu¬ 
rate  and  detiuled  the  solution.  The  numerical  solution  of  the  Boltzman  equation  is 
obtained  by  integrating  particles  along  trajectories  in  phase  space.  The  momemtum 
of  each  particle  is  abruptly  changed  when  scattering  events  occur. 

One  of  the  most  important  elements  in  establishing  an  accurate  physical  model 
is  the  inclusi<Hi  of  a  full  band  structure  (FB)  which,  for  example,  can  be  calculated 
from  empirical  pseudo-potential  or  local  density  functional  theory.  Unfortunately  the 
inclu8i<m  of  a  full  band  structtire  considerably  complicates  the  MC  simulation.  First, 
the  particle  velocity  is  not  simply  related  to  k  but  is  determined  by  the  gradient 
the  energy  band.  Fulure  to  integrate  the  equations  of  motion  exactly  will  result 
is  unphysical  gains  or  loses  in  the  particle  energy.  Secondly,  the  scattering  rates 
conserve  energy  which,  for  simple  dispersionless  phonon  models,  restricts  the  final 


state  momentum,  P,  to  lie  on  energy  iaosurfaces.  For  a  general  band  structure,  these 
surfaces  may  be  extremely  complica^  requiring  elaborate  searches  of  the  Brilluoin 
Zone  for  the  final  state  momentum,  k'. 

The  computation  complexity  of  MC  simulations  may  be  significantly  reduced  if 
an  analytic  expression  is  used  for  the  band  structure.  Isotropic  polynomial  approx¬ 
imations  to  the  energy  bands  enable  efficient  and  accurate  calculations  of  the  free 
flight  trajectories  and  final  states  momentum  selection.  Analytical  bands  (AB)  are 
rigorously  justified  for  low  field  transport  since  the  carriers  remain  close  to  the  band 
minimum,  where  the  bands  are  parabolic.  Although  AB  models  are  computationally 
efficient,  accuracy  is  sacrificed,  at  high  energies,  where  simple  band  approximations 
are  difficult  to  construct. 

We  propose  a  FB  MC  algorithm  [2,  3],  based  on  simplexes,  which  has  the  com¬ 
putational  advantages  of  the  faster  AB  simulators,  while  preserving  the  physics  con¬ 
tained  in  the  full  band  structure.  In  k-space,  the  irreducible  wedge  is  represented  by 
a  contour-aligned  tetrahedral  grid.  An  adaptive  algorithm  is  developed  to  approxi¬ 
mate  each  energy  band  by  a  piecewise  linear  polynomial  to  a  pre-defined  accuracy. 
Within  each  element,  the  equations  of  motion  may  therefore  be  integrated  exactly, 
thus  conserving  energy  at  each  time  step.  Furthermore,  the  energy  isosurfaces  for  the 
entire  Brilluoin  Zone,  may  be  readily  calculated  providing  an  accurate  final  k-state 
selection.  As  a  result,  the  simulation  error  to  the  statistical  error  inherent  in  MC, 
and  a  consistently  bounded  discretization  error  deriving  from  the  choice  of  grid  in 
phase-space. 

Our  algorithm  represents  phase  space  as  a  cartesian  product  of  two  simplex  grids, 
Tk  X  Tg.  In  k-space,  only  the  irreducible  wedge,  defined  by  the  48  elements  of  the 
point  group  pertaining  to  cubic  semiconductors  is  triangulated.  A  rotation  matrix, 
Q,  is  assigned  to  each  particle  rotates  the  position  in  the  irreducible  wedge  to  its 
proper  place  in  the  full  Brilluoin  zone.  In  both  spaces,  piecewise  linear  polynomial 
approximations  are  tised  to  represent  the  electrostatic  potential  and  each  energy  band. 

Since  the  equation  of  motions  are  linear,  particle  trajectories  may  be  computed 
exactly.  Pointers  to  adjacent  elements  are  used  to  facilitate  motion  through  the  grid. 
Neighbor  pointers  are  also  constructed  at  the  phase  space  boundaries,  where  particles 
may  be  either  transmitted  <x  reflected.  Ellement  pointers  at  reflected  surfaces  point 
to  themselves,  where  as  transmitted  particles  reenter  the  phase  space  grid  at  some 
other  point.  In  either  case,  the  simulation  particles  remain  confined  to  the  solution 
domain  and  enough  informati<m  is  retained  to  reproduce  the  entire  phase  space. 

The  selection  of  the  final  state  momentum  is  also  exactly  computed  for  piecewise 
linear  polynomial  ^proximations.  Since  is  a  constant,  the  constant  energy  sur¬ 
faces  are  defined  by  the  intersecticm  of  a  plane  with  the  tetrahedron.  An  efficient, 
single  parameter,  quadratic  interpolation  procedure  may  be  developed  when  the  ver¬ 
tices  of  the  tetrahedron  are  constrained  to  lie  between  two  constant  energy  surfaces. 
Each  tetralMdra  classified  by  the  number  of  vertices  lying  on  an  pven  energy  contour, 
Ei.  The  density  of  states  is  then  characterized  by  the  single  interpolation  parameter. 
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^£),  and  ^ven  by 
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II 

(1) 

V^^^iE)  =  -  d) 

(2) 

V^"’\E)  = 

(3) 

where  ^E)  =  (E  —  E, •)/(£, +1  —  Ej)  and  Ei  <  E  <  Ei+i.  The  major  advantage  of  this 
decomposition  is  that  global  quantities,  obtained  by  summing  over  tetr2diedra,  may 
be  interpolated  in  the  same  manner.  The  selection  of  the  final  state  momentum  is 
readily  computed  by  proceeding  down  a  k-space  tree.  The  rotation  matrix,  Q,  is  first 
determined,  then  the  tetrahedra  type,  and  finally  a  single  tetr2diedron  containing  the 
final  state.  In  addition  to  two  uniformly  distributed  random  variables  are  sufficient 
to  determine  the  barycentric  coordinates  within  the  tetrahedron.  The  final  state 
momentum  is  then  computed  by  a  linear  interpolation  of  the  tetrahedron  vertices, 

Since  the  the  trajectory  calculations  and  the  selection  of  the  final  state  momentum 
are  performed  exactly,  the  only  numerical  error  encountered  in  the  solution  of  the 
discretized  BTE  is  the  statistical  error  associated  with  sampling  the  phase  space. 
This  error  is  bounded  during  each  monte  carlo  run  by  monitoring  the  fluctuations 
in  computed  quantities.  However,  a  second  source  of  numerical  error  arises  from 
approximating  the  continuous  BTE  with  a  discrete  set  of  equations.  Complete  control 
of  the  numerical  error  is  obtuned  by  constructing  meshes  that  provide  a  consistent 
bound  on  the  discretization  error. 

We  use  an  adaptive  grid  algorithm  to  construct  meshes  that  represent  functions  in 
both  k-space  and  x-space  to  a  desired  accuracy.  These  algorithms  are  based  upon  esti¬ 
mating  the  discretization  error,  e,  in  approximating  continuous  functions  by  piecewise 
linear  polynomials.  In  x-space,  simple,  inexpensive  error  estimates  are  constructed 
directly  from  Poisson’s  equation,  [1].  These  estimates  provide  both  lower  and  upper 
bounds  to  the  true  discretization  error,  0  <  Ci|{|e|||  <  |||0  —  0fc|||  <  C3|||c|||  For 
k-space  grids,  the  discretization  eircu'  is  computed  as  deviations  of  the  true  band  en¬ 
ergies  from  piecewise  linear  approximations.  The  meshes  are  continually  refined  or 
unrefined  until  ||c||  <  S  where  £  is  a  user  specified  error  tolerance.  By  controlling 
the  discretizaticm  error  of  these  phase  space  grids,  we  can  insure  that  numerical  error 
associated  with  our  MC  simulations  is  bounded. 

Several  considerations  are  necessary  to  produce  an  acceptable,  contour-aligned, 
tetrahedral  grid  tiw  our  monte  carlo  simulations.  Unlike  conventional  adaptive  refine¬ 
ment  algorithms,  eadi  grid  edge  must  lie  dther  on  an  contour  or  connect  two  adjacent 
omtours.  Mesh  refinement  is  accomplished  by  adding  new  energies  contours  rather 
than  simply  refining  element  edges.  Sharp  cusps  are  formed  in  the  higher  energy 
bands,  due  to  band  crosrings,  which  produce  discmitinuous  changes  in  VE(k).  These 
features  rignificantly  effect  the  accuracy  of  any  pdynominal  approximation  to  the  en¬ 
ergy  bands.  As  with  any  mesh  generation  code,  the  grid  should  omsist  of  reasonably 
shaped  tetrahedra. 


226 


’4 


Critical  points  and  ridge  lines  play  an  essential  role  the  production  of  accurate 
contour-aligned  grids.  The  role  of  critical  points,  k  such  thatV£(ib)  =  0,  is  twofold. 
Local  extrema  make  excellent  vertices,  especially  for  coarse  piecewise  linear  approxi¬ 
mations.  Also,  saddle  points  describe  how  components  of  contours  split  and  connect 
[4,  9,  8, 10].  The  role  of  ridge  lines,  extrema  in  the  curvature  of  the  isosurface,  is  used 
to  decompose  the  space  between  two  contour  surfaces.  When  a  ridge  line  connects, 
say,  a  minimum  to  an  adjacent  saddle  point,  the  “watershed”  regions  between  ridge 
lines  are  empirically  observed  to  provide  a  good  starting  decomposition. 

The  initial  grid  is  generated  from  a  skeleton  of  each  energy  b£md.  The  skeleton 
consists  of  a  set  of  ridge  lines  and  critical  point  energies  that  divide  the  wedge  into 
several  regions.  Within  each  region,  vertices  are  generated  along  contours  that  ad¬ 
here  to  both  the  wedge  boundaries  and  ridge  lines.  Curved  region  boundaries  are 
represented  by  piecewise  linear  line  segments  with  sufficient  resolution  to  capture  the 
geometry  of  the  boundary.  Contour-aligned  grids  are  produced  by  refining  grid  edges. 
Edges  that  cross  more  then  one  contour  are  divided  by  adding  a  point  at  the  midpoint 
contour  energy.  Ekiges  that  lie  on  a  contour  are  refined  either  to  reduce  the  discretiza¬ 
tion  error  or  to  improve  the  quality  of  the  resultant  tetridiedra.  After  all  edges  are 
divided  at  most  once,  the  grid  is  then  retriangled  producing  a  new  set  of  tetrahedra. 
The  overall  accuracy  of  the  contour-aligned  grids  is  controlled  by  selectively  adding 
new  energy  contours  when  necessary. 

The  lowest  seven  silicon  energy  bands  were  computed  using  the  nonlocal  pseudo¬ 
potential  of  Chelikowsky  and  Cohen  [5]  .  The  first  3  bands  correspond  to  holes 
while  the  last  4  bands  correspond  to  electrons.  To  easily  accommodate  interband 
transitions,  a  single  set  of  energy  contours  was  calculated  for  all  bands.  A  error 
tolerance  oiSk  =  0.02£(ib)  was  used  in  generating  all  grids.  The  low  energy  spectrum 
was  further  selectively  refined  to  produce  an  error  if  6  =  0.002ev  near  the  minimum 
energy.  As  shown  in  Table  1  ,  accurate,  high  quality,  contour-aligned  tetrahedral 
grids  and  be  obtuned  with  relatively  few  grid  points.  The  grid  for  the  lowest  electron 
band  in  the  x-y  plane  of  the  irreducible  vwdge  is  shown  in  figure  1. 

The  performance  of  our  algorithm  is  compared  with  several  existing  monte  carlo 
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Table  2:  Performance  Comparison 


Code 

Band 

Structure 

CPU  Time  (YMP)  || 

Time  Step 

Total 

IBM 

Full 

5.0* 

100.0* 

Illinois 

Full 

1.5 

25.0 

BEBOP 

Analytic 

1.0 

2.0 

MMC 

Full 

1.0 

1.0 

*  estimated 

c*.des  in  Table  2.  The  test  example  consisted  of  simulating  10000  electrons  in  a 
constant  electric  field  of  lO^V/cm  for  a  simulation  time  of  1  picosecond.  To  obtain  a 
fair  comparison,  the  full  band  Illinois  code  [11],  the  analytic  band  BEBOP  code  [12] 

,  and  our  full  band  code,  MMC,  were  run  on  the  same  machine,  a  Cray  YMP.  The 
CPU  times  for  IBM  monte  carlo  code  [7]  were  taken  from  published  data  [6].  These 
timings  are  qualitative  since  a  different  problem  was  solved  on  a  different  machine. 
Large  time  steps  may  be  taken  for  our  simulations  and  BEBOP,  since  the  equations 
of  motions  and  the  final  state  energy  selection  are  computed  to  the  machine  precision. 
Small  time  steps  for  the  other  full  band  codes  are  necessary  to  integrate  the  equations 
of  motion  to  a  reasonable  numerical  accuracy  due  to  the  general  band  structure.  The 
significant  increase  in  CPU  time  of  the  IBM  code  as  compared  to  the  Illinois  code  is 
largely  attributed  to  the  exhaustive  time  spent  in  the  finaJ  state  momentum  selection. 
By  any  measure,  our  code  combines  the  rigorous  physics  of  full  band  simulations  with 
the  computational  efficiency  of  analytical  band  models. 

A  computationally  efficient,  full  band  structure,  monte  carlo  algorithm  has  been 
developed.  The  algorithm  is  based  on  a  simplex  decomposition  of  phase  space,  and 
has  all  of  the  advantages  of  analytic  band  simulators,  while  preserving  the  physics 
contained  in  the  full  band  structure.  The  work  required  for  free-flight  computations 
as  well  as  final  state  selection  is  considerably  reduced  through  the  use  of  piecewise 
linear  approximations  to  both  the  energy  and  the  electrostatic  potential.  This  method 
treats  motion  in  k-space  and  x-space  symmetrically  allowing  exact  integrations  of  the 
equations  of  motion  and  final  state  selection. 

In  k-space,  the  irreducible  wedge  is  represented  by  a  contour-aligned  tetrahedral 
grid.  An  adaptive  grid  algorithm  is  used  to  generate  a  mesh  that  approximates  the 
energy  bands  to  a  pre-defined  SK;curacy.  Furthermore,  the  equal-energy  surfaces  may 
be  readily  calculated  providing  an  accurate  final  k-state  selection.  As  a  result,  the 
simulation  error  is  reduced  to  the  monte  carlo  statistical  error  and  a  consistently 
bounded  discretization  error  deriving  from  the  choice  of  grid  in  phase-space. 
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Abstract 

An  ionised  impurity  scattering  model  for  Monte  Carlo  calculations  is  proposed.  This 
model  includes  the  following  four  corrections  to  the  simple  Brook-Herring  model:  1) 
momentiun  dependent  screening,  2)  the  second  Bom  term  for  single  impurity  scattering,  3) 
dressing  effect  of  impurities  on  electron  wavevectors,  4)  scattering  from  pairs  of  impurities. 
Monte  Carlo  calculations  are  performed  for  majority  electrons  in  silicon  at  300K  and  77K 
with  donor  concentrations  from  10^*cm~^  to  10^cm~’.  Calculations  show  this  model 
provides  a  substantial  improvement  to  models  based  on  the  first  Bom  approximation. 


I.  Introduction 

Monte  Carlo  techniques  have  become  a  popular  method  to  simulate  the  characteristics  of 
modem  semiconductor  devices.  In  these  device  simulation  programs,  an  ionised  impurity 
scattering  model  based  on  the  first  Bom  approximation  is  commonly  used[l,2].  For  many 
materials,  such  as  silicon  and  GaAs,  modeb  based  on  the  first  Bom  approximations  over¬ 
estimate  mobility  for  concentrations  greater  than  10^*cm~^  at  room  temperature[3].  Witn 
ionised  impiirity  concentrations  in  many  semiconductor  devices  greater  than  lO^’cm"^, 
an  improved  impurity  scattering  model  for  Monte  Carlo  calculations  is  clearly  needed. 
Recently  Kay  and  Tang[4]  proposed  an  improved  model  for  Monte  Carlo  calculations. 
Their  model  included  phase-shift  analysis  and  multiple  impurity  scattering  where  they 
calibrated  their  model  to  fit  experimental  data.  Fischetti[5]  showed  excellent  quantitative 
agreement  with  experimental  data  for  both  n  and  p  doped  silicon  at  300K  by  including 
plasmon  scattering  to  phase-shift  analysis  and  short-range  carrier-carrier  scattering.  An¬ 
other  approach  which  is  becoming  more  popular  when  simxdating  coulombic  interactions 
is  Molecular  Dynamics.  Ferry  and  workers[6]  obtained  excellent  agreement  with  experi¬ 
mental  results  for  GaAs  at  77K.  Molecular  Dynamics  has  the  disadvantage  of  requiring 
large  amounts  of  CPU  time. 

In  this  work  we  include  four  corrections  to  the  simple  Brooks-Herring  impurity  scattering 
model  and  incorporate  these  in  an  Ensemble  Monte  Carlo  program.  These  corrections 
include  momentum  dependent  screening,  dressing  effects  on  the  electron  wavevectors,  the 
second  Bom  approfximation,  and  multiple-inq)urity  scattering.  Mobility  is  calculated  for 
majority  electrons  in  silicon  at  3(K)K  and  77K  for  impurity  concentrations  ranging  from 
10**cm~*  to  10*’cm~*. 
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n.  Electron  lYansport  Model 

The  electron  tranaport  model  contains  all  six  non-parabolic  ellipsoidal  z  valleys.  Three 
intervalley  p-type  phonons  between  parallel  valleys  and  three  intravalley  /type  between 
perpendicular  valleys,  elastic  acoustic  phonon  scattering,  electron-electron  scattering  and 
impurity  scattering  are  considered.  The  phonon  scattering  and  material  parameters  given 
by  Brunetti  and  coworkers[7]  and  the  phonon  scattering  rates  calctUated  by  Jacoboni  and 
Reggiani(8]  are  used  in  this  work.  Electron-electron  scattering  rates  take  into  account  the 
ellipsoidal  nature  of  the  conduction  band  as  developed  by  Osman  and  coworkers[9].  The 
impurity  scattering  model  is  discussed  in  the  next  section. 


ni.  Electron-Impurity  Scattering  Model 

A  review  of  the  procedure  used  to  implement  corrections  to  the  basic  Brooks-Herring 
imptirity  scattering  model  is  given  in  this  section.  A  more  complete  description  will  be 
published  elsewhereflO]. 

As  discussed  in  the  introduction,  four  corrections  to  the  standard  Brooks-Herring  impurity 
scattering  model  are  implemented  in  our  Monte  Carlo  program.  The  standard  Debye 
screening  length  is  modified  by  including  a  momentum  dependent  correction  using  the 
equation  developed  by  Chung  and  Ferry[ll].  Momentum  dependent  screening  results  in  a 
scattering  rate  which  is  dependent  upon  the  scattering  angle  6.  For  a  screened  coulombic 
potential  based  on  the  first  Bom  approximation  the  scattering  rate  becomes 


S(k,*)  =  fl(k) 


where  n(k)  is  the  wavevector  dependent  constant  normally  seen  in  the  Brooks-Herring 
scattering  rate  equation.  This  scattering  rate  cannot  be  calculated  in  its  present  form, 
since  the  state  after  scattering  is  not  known  prior  to  the  scattering  event.  This  is  dealt 
with  in  Monte  Carlo  calculations  by  using  a  large  correction  to  the  inverse  screening  length 
resulting  in  a  scattering  rate  with  internal  self-scattering.  When  impurity  scattering  is 
chosen,  the  scattering  angle  is  selected  using  a  rejection  method.  Once  the  scattering 
angle  is  known,  it  is  then  determined  whether  impurity  or  self-scattering  terminated  the 
particle’s  free  flight. 

The  three  other  corrections  are  developed  following  a  procedure  similar  to  Moore[12]  which 
is  also  described  by  Langer[13]  and  Rickaysen[14].  The  basic  procedure  is  to  expand  the 
self  energy  and  define  a  dressing  function  as  the  real  part  and  a  width  function  as  the 
imaginary  part  of  the  self-energy  tenns[15].  In  this  work,  we  included  the  first  two  terms 
of  the  self  energy  for  single  impurity  scattering  which  can  be  thought  of  as  the  first  and 
second  Bom  terms  and  the  first  term  for  scattering  by  a  pair  of  impurities.  The  dressing 
effect  on  the  electron  wavevectors  is  included  only  on  the  first  single  impurity  term.  The 
dressing  effect  on  electron  energy  was  small  compared  to  other  the  corrections,  this  was 
also  observed  by  Moore[16],  and  is  not  included. 

When  the  dressing  effect  of  the  impurities  on  the  electron’s  wavevector  is  included,  the 
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Figure  1:  Majority  EUectnui  Mobility  as  a  function  of  ionised  impurity  concentration  for 
silicon  at  300K 


scattering  rate  based  on  the  first  Bom  term  becomes 


,(k)  = 


1- 


mnV  (3/9*  +  4k*) 
SirfiV^  ^*  +  4k*)*. 


i-a 


(2) 


where  5m  i*  the  scattering  rate  using  the  Brooks-Herring  model,  n  u  the  impurity  con¬ 
centration,  m  is  the  electron  mass,  e  is  electron  charge,  e  b  the  material  constant,  and  li 
b  Planck’s  constant.  If  momentum  dependent  screening  is  used,  5m  b  given  by  equation 
(1)  and  b  incorporated  in  the  same  manner. 

The  scattering  rate  for  the  second  Bom  term  b  pven 

tan-*  ( 


nm*e* 


l/» 


sin(d/2)(l-t-y-l-4x)(H-4x) 


(3) 


where  *  =  ^  sin*(d/2)  and  y  =  4k* //9*.  Equation  (3)  b  int^rated  numerically  over  i 
to  obtain  the  scattering  rate  and  a  rejection  method  b  used  to  determine  the  scattering 
angle  0.  As  observed  by  other  researdiers[16],  the  correction  to  mobility  by  the  second 
Bom  term  b  large  in  region  where  the  first  Bom  term  alone  b  valid  k*/^*  »  1.  Thb  b 
dealt  with  by  setting  the  scattering  rate  for  the  second  Bom  term  to  sero  when  the  first 
B<»n  term  alone  b  mlid. 

The  multi-particle  transition  rate  b  found  1^  considering  only  the  lowest  approximation 
for  scattering  from  a  pab  of  impurities.  Thb  b  the  same  form  as  developed  by  Moore[12] 
and  Ridmysen(14].  The  resulting  scattering  rate  b  quite  oom]dex  and  will  not  be  pven 
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Figure  2:  Majority  Electron  Mobility  aa  a  function  of  ionized  impurity  concentration  for 
silicon  at  300K 

here.  Multiple  impurity  scattering  is  incorporated  in  the  same  manner  as  the  second  Bom 
term. 

IV.  Results 

Figures  1  and  2  show  a  comparison  of  the  basic  Brooks>Herring  model,  the  model  dis¬ 
cussed  in  this  PM>er,  and  experimental  data[17]  for  silicon  at  300K  and  77K,  respectively. 
As  figure  1  shows,  the  proposed  model  clearly  improves  upon  the  Brooks-Hening  impurity 
scattering  model  and  compares  favorably  with  experimental  data  over  the  range  of  con¬ 
centrations  considered.  Figure  2  shows  the  discrepancy  between  the  Brook-Herring  model 
and  experimental  data  widens  as  the  temperature  is  decreased.  Again,  the  proposed  model 
clearly  improves  upon  the  Brooks-Herring  model. 

V.  Conclusion 

We  have  proposed  an  improved  impurity  scattering  model  for  Monte  Carlo  calculations. 
The  improved  model  indudes  four  corrections  to  the  Brooks-Herring  impurity  scatter¬ 
ing  modd.  Results  for  i»-doped  silicon  at  300K  and  77K  show  o\ir  modd  substantially 
improves  mobility  calculations  compared  to  noodeb  based  on  the  first  Bom  approximation. 
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Abstract 

We  present  a  numerical  study  of  the  non-equilibrium  properties  of  quasi-one  dimensional 
systems  with  different  confining  geometries.  Carrier  coohng  following  a  laser  photoex¬ 
citation  is  analyzed  by  means  of  a  Monte  Carlo  simulation  which  includes  all  relevant 
scattering  mechanisms.  Our  results  indicate  that,  for  wire  made  available  by  todays 
technology,  the  carrier  dynamical  properties  of  these  systems  are  not  so  different  with 
respect  to  bulk  GaAs. 

I.  Introduction 

The  recent  advances  in  compound  sexniconductor  technology  have  allowed  the  fabrica¬ 
tion  of  wire-like  regions  where  quasi  ID  confinement  is  achieved  in  a  narrow-band-gap 
semiconductor  material  surrounded  by  another  large-band-gap  semiconductor  [1].  In 
putictilar,  such  wire-like  structures  have  been  fabricated  with  rectangular  and  V  shaped 
cross  sections  whose  dimensions  are  comparable  to  the  carrier  coherence  length  [2,3]. 
In  such  structures,  carrier  transport  characteristics  are  expected  to  differ  from  the  bulk 
case  as  the  carrier-phonon  scattering  rate  is  affected  by  the  changes  in  the  electronic  eis 
well  as  in  the  vibrational  properties  induced  by  layering  [4]. 

Much  of  the  theoretical  work  dedicated  so  far  to  one  dimensional  structures  has 
covered  mainly  the  so  called  “extreme  quantum  limit”,  i.e.  the  case  in  which  only  one 
subband  is  occupied  [1,5].  For  wire  structures  available  today,  such  hmit  can  only  be 
reached  under  extremely  low  temperature  and  electric  field  conditions.  Usually,  several 
subbands  are  present,  and  their  contribution  to  carrier  transport  has  to  be  taken  into 
account. 

The  aim  of  the  present  contribution  is  to  extend  the  analysis  of  charge  transport 
in  quasi  one-dimensional  systems  to  nonequihbrium  conditions  emd  nonhnear  regimes, 
by  studying  realistic  wire-like  structures  where  many  subbands  are  normally  occupied. 
This  is  performed  by  means  of  an  Ensemble  Monte  Carlo  simtilation  which  allows  the 
study  of  dynamical  evolution  of  photoexcited  carriers,  a  situation  often  achieved,  for 
instance,  in  ultrafast  optical  experiments.  By  using  the  electronic  levels  and  wave 
functions  as  derived  from  an  exact  solution  of  Schrodinger’s  equation,  we  evaluate  the 
electron-phonon  and  electron-electron  scattering  rates  which  are,  in  turn,  used  in  the 
simulation  to  study  the  carrier  relaxation  process.  In  addition,  the  method  allows  us  to 
easily  include  effects  like  degeneracy  and  non-equilibrium  phonons. 


We  will  investigate  two  different  type  of  GaAs-based  structures,  namely  rectangular 
wires  obtained  from  multi  quantum  well  samples  by  chemical  etching  [2,6]  and  V-grooved 
wires  obtained  by  growing  a  thin  quantum  well  on  an  etched  V-shaped  profile  [7]. 

lI.Electronic  states  and  electron*phonon  interaction 

Chemical  etching  gives  GaAs-based  rectangtdar  quantum  wire  surrounded  along  one 
of  the  confinement  directions  by  AlGaAs  layers,  and  and  free-standing  along  the  other 
confinement  direction  [6].  In  this  case,  the  conduction  band  electrons  can  be  reasonably 
described  by  a  factorized  envelope  function  of  the  type  ^ ,  where  ipix)  amd 

^(y)  are  respectively  the  solution  of  the  one  dimensional  Schrodinger  equation  in  a 
rectangular  potential. 

The  situation  is  more  complicated  for  the  V-grooved  structure.  There,  in  order 
to  study  the  energy  levels  and  associated  eigenfunctions  of  a  two-dimensional  V-like 
profile,  we  consider  the  following  single-particle  Schrodinger  equation 

+  V{x,y)^ifix,y)  =  £il>{x,y)  (1) 

where  denotes  the  two-dimensional  Laplace  operator  and  V{x,y)  the  two- 
dimensional  potential  profile.  By  considering  the  following  set  of  two-dimensional 
planewaves  over  a  rectangular  domain  (1: 


*(k.,k,;l,!,)  =  .  (2) 

the  Schrodinger  equation  (1)  can  be  rewritten  as 

A  4kUk 

where  the  coefficients  c(kj,kJ,)  are  the  Fourier  component  of  the  total  wavefunction 
V’(z,y)  and  V(kz,ky;kx,ky)  are  the  matrix  elements  of  the  potential  profile  in  the 
plane-wave  basis  given  in  Eq.  (2).  By  means  of  a  standard  numerical  procedure,  we 
derive  the  eigenvalues  S  corresponding  to  the  energy  levels  and  the  Fourier  coefficients 
c(k^,ky)  of  the  corresponding  eigenfunction.  Once  such  coefficients  are  known,  we 
obtain  for  each  energy  level  S  the  corresponding  eigenfunction  according  to  V’(z,t/)  = 
2k'  k'  numerical  results  have  been  obtained  for  the  case  of  a 

V-like  potential  profile  shown  in  Fig.  1.  The  V-like  region  is  caracterized  by  a  potential 
Vi  and  all  the  surrounding  region  by  a  potential  V2.  Fig.  2  shows  a  plot  of  the  carrier 
density  corresponding  to  the  first  four  electronic  levels.  As  expected,  the  ground  state 
exibits  a  single  maximum  in  the  center  of  the  V-like  region,  while  the  excited  states 
extend  over  the  whole  V-groove  exhibit  an  increasing  number  of  m2ixima  as  the  order 
of  the  level  increases. 

Starting  from  the  electronic  wave  functions  and  energy  levels  just  described,  we  have 
computed  the  electron-phonon  scattering  rate  for  the  two  structures,  assuming  in  both 
cases  bulk  LO  phonon  modes.  This  approximation  should  be  qtiite  acceptable  in  these 


')=£(k.,ky),  (3) 
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Fig.  2.  Electron  charge  density  in  the  V-grooved  quantum  wires  for  the  first  four 
energy  levels. 
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systems,  which  have  a  large  spatial  extension,  due  to  the  recent  findings  that  the  overall 
electron-LO  phonon  scattering  rate  in  quantum  wells  does  not  differ  significantly  from 
the  bulk  one  once  the  contribution  of  confined  and  interface  phonon  modes  is  properly 
accounted  for  [8].  Figures  3  (a)  and  (b)  show  respectively  the  scattering  rate  for  the 
rectangular  quantum  wire  and  the  V-grooved  wire  (solid  lines)  [9],  compared  with  bulk 
GaAs  rate  (dashed  line).  In  the  V-grooved  structure,  only  the  first  five  energy  levels 
are  accounted  for.  Apart  from  the  different  position  of  the  energy  levels  in  the  two 
quantum  wires,  the  behavior  is  very  similar  and  the  overall  scattering  rates  in  both 
wires  is  sinoilar  to  that  of  the  bulk. 
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Energy  (eV)  Energy  (eV) 

Fig.  3.  Electron-phonon  scattering  rate  for  (a)  a  rectangular  cross  section  quantum 
wires  and  (b)  a  V-grooved  quantum  vdres  (solid  lines),  compared  with  the  bulk 
scattering  rate  (detshed  line). 

III.  Simulation  results 

As  the  scattering  rate  for  the  V-groove  and  the  rectangular  wire  are  similar  we  per¬ 
formed  the  similation  only  for  the  rectangular  wire.  We  espect  that  the  results  will  be 
qualitatively  the  same  for  the  V-groove  wire.  Our  Monte  Carlo  simulation  accounts  for 
the  electron-polar  optical  phonon  as  well  as  the  electron-electron  interaction  for  both 
intrasubband  and  intersubband  transitions.  The  model  used  here  is  similar  to  that  de¬ 
scribed  in  ref.  10  but  the  effect  of  non-equilibrium  phonons  have  been  accounted  for 
with  a  rejection  technique.  The  study  of  the  effect  played  by  the  hot  phonon  popu¬ 
lation  is  more  difficiilt  in  this  system,  compared  to  the  bulk  case,  due  to  the  lack  of 
isotropy.  The  phonon  population  has  been  collected  using  a  three-dimensional  mesh. 
The  cooling  of  the  excited  carriers  following  a  laser  excitation  is  shown  in  Fig.  4  (a), 
(b)  and  (c)  respectively  for  the  one  dimensional  densities  of  10*  cm~^,  3  x  10*  cm~^ 
and  10*  cm~^.  As  already  verified  [11]  the  cooling  in  the  quantum  wire  (solid  line)  is 
slower  compared  to  the  bulk  (dashed  line)  due  to  the  reduced  intersubband  scattering 
rate,  but  this  effect  is  modified  by  the  presence  of  the  hot  phonon  population.  At  very 
low  densities,  i.e.  when  the  phonon  population  is  not  driven  out  of  equilibrium,  there  is 
a  big  difference  in  the  cooling  between  the  wire  and  the  bulk.  Vice  versa  at  the  higher 
density  this  difference  is  strongly  reduced.  This  effect  can  be  clearly  attributed  to  the 
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reduced  efficency  of  hot  phonon  reabsOTption  in  the  quantum  wire  caused  by  the  spatial 
anisotropy  oi  the  s]rstan. 


t  (ps)  t  (ps)  t  (ps) 

Fig.  4.  Electron  mean  energy  after  a  laser  photoexdtation  in  a  quantum  wire 
(solid  lines)  for  three  different  densities:  (a)  10®  cm~^,  (b)  3  x  10®  cm“^  and  (c) 
10*  cm~^,  compared  with  the  equivalent  results  in  bulk  GaAs  (dashed  lines). 


In  conclusion,  we  have  shown  that  the  Monte  Carlo  method  can  be  easely  applied 
to  the  study  of  low  dimensional  systems  described  by  any  type  of  geometrical  potential 
confinement.  The  dynamical  properties  of  the  carriers  can  be  invesitgated  in  strongly 
non-linear  and  non-equilibrium  conditions  with  any  number  of  occupied  subbands.  Our 
results  show  that,  for  the  relatively  large  wire  made  available  by  the  actual  technology, 
the  difference  between  the  behaviour  of  these  system  and  a  bulk  system  is  much  lower  of 
that  predicted  and  probably  only  in  systems  extremely  narrow  a  faster  carrier  transport 
will  be  found. 
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Abstract 

A  2D  multilayer  MOSFET  simulator  has  been  developed,  using  self-consistently  coupled 
ensemble  Monte  Carlo  and  2D  Finite  Element  Poisson  Solver  algorithms.  The  simulator  is 
used  to  investigate  the  operation  of  sub-micron  delta-doped  MOSFETs,  in  order  to  assess  their 
suitability  for  high  density  logic  applications.  An  operating  window  is  observed  within  which 
the  delta-doped  devices  exhibit  a  significant  reduction  in  surface  carrier  density,  relative  to 
conventional  MOSFETs,  implying  reduced  surface  scattering  and  gate  injection. 

I.  Introductioii 

In  order  to  achieve  Ultra  Large  Scale  Integration  (ULSI)  densities  in  CMOS,  the  gate  lengths 
of  silicon  MOSFETs  must  be  reduced  well  below  1pm.  It  is  well  known  that  sub-micron 
MOSFETs  are  prone  to  a  number  of  short  channel  effects:  increased  lateral  electric  Helds 
result  in  enhanced  hot  carrier  generation,  whilst  increased  vertical  fields  pull  carriers  hard 
onto  the  Si/Si02  interface  -  lea^g  to  increased  interface  scattering  and  hot  carrier  injection 
into  the  Si02  layer.  The  latter  effect  results  in  an  accumulation  of  trapped  charge  in  the  oxide, 
which  causes  eventual  device  failure  by  shifting  the  threshold  voltage  and  reducing  the  ability 
of  the  gate  to  modulate  charge. 

These  problems  may  be  alleviated  by  using  a  delta  doped  MOSFET  structure  [1,2],  in  which 
conduction  occurs,  not  at  the  Si/Si02  surface,  but  in  an  ultrathin,  highly  doped  layer  located 
typically  200-400A  below  the  interface.  Thus,  both  interface  scattering  and  hot  carrier 
injection  should  be  reduced:  however,  realistic  modelling  work  is  necessary  to  determine  the 
efficacy  of  the  delta  layer  in  confining  carriers,  its  effect  on  the  surface  carrier  density  and 
energy  distribution,  and  the  extent  of  any  consequent  reduction  in  hot  carrier  degradation. 

In  this  paper,  we  describe  the  development  of  a  2D  self-consistent  Monte  Carlo  simulator,  and 
its  application  to  modelling  electronic  transport  in  delta-doped  MOSFETs.  In  very  short  gate 
devices  transport  can  no  longer  be  described  in  terms  of  steady  state  mobilities  and  saturation 
velocities,  and  the  traditional  drift-diffusion  sqiproacb  to  device  modelling  becomes 
ina^uate.  The  Monte  Carlo  method  is  well  established  as  a  powerful  technique  for 
modelling  highly  non-equilibrium  transport  ptenomena,  and  can  yield  a  detailed  microscopic 
insight  into  the  (^ration  of  semiconductor  devices.  The  initial  investigation  presented  here 
concerns  the  basic  issues  of  transistor  operation  and  carrier  distribution  in  short  gate  delta 
defied  MOSFETs,  and  the  effect  of  the  delta-dr^ied  layer  on  the  surface  electron  density. 


II.  The  Slmubition  Program 


We  have  developed  a  high  specification  2D  simulation  program  for  sub-micron  multilayer 
MOSFETs.  The  program  consists  of  an  ensemble  Monte  Carlo  algorithm  self-consistently 
coupled  to  a  2D  Finite  element  Poisson  Solver.  The  2D  Poisson  equation  is  re-solved  every 
Ifs:  such  a  short  timestep  is  necessary  to  avoid  undersampling  of  the  plasma  oscillations 
which  occur  in  the  highly  doped  source  and  drain  implants  of  silicon  MOSFETs. 

The  Finite  Element  mesh  can  be  defined  by  the  user,  but  is  restricted  to  a  rectangular  grid 
format.  This  restriction  greatly  simplifies  the  determination  of  the  2D  charge  density  in  the 
device  from  the  distribution  of  Monte  Carlo  particles.  The  ability  to  specify  non-uniform 
mesh  spacings  is  essential  for  both  delta-doped  and  conventional  MOSFETs,  in  order  that  the 
potential  at,  in  the  former  case,  the  delta-doped  layer,  and,  in  the  latter  case,  the  surface 
inversion  layer,  may  be  accurately  modelled. 

Electronic  transport  in  the  MOSFETs  is  modelled  using  a  nonparabolic,  ellipsoidal  description 
of  the  6  silicon  X-valleys  Intervalley,  and  acoustic  intravalley  phonon  scattering  is  included 
using  the  parameter  set  recommended  by  Brunetti  et  al.[3],  which  was  found  to  produce  good 
agreement  with  experimental  drift  velocity  data.  Impurity  scattering  is  included  via  the 
Brooks-Herring  model. 

The  effect  of  degeneracy  (Pauli  exclusion)  is  modelled  using  the  approach  described  by  Lugli 
and  Ferry  [4],  in  which  the  probability  that  an  electron  is  scattered  into  a  state  of  energy  E  is 
weighted  by  l-f(E),  where  f(E)  is  the  local  electronic  distribution  function.  In  our  simulation, 
the  distribution  function  may  be  sampled  in  up  to  8  different  spatial  regions  -  including  the 
source  and  drain  implants,  and  sections  along  user  defined  surface  and  buried  layers  -  to 
account  for  spatial  variations  in  electron  density  and  carrier  heating. 

In  order  to  model  self-consistently  the  electrostatic  potential  at  the  pn  junctions  between  the 
implants  and  the  channel  region,  a  2D  hole  density  is  included  in  a  zero  current  (fixed  quasi 
Fermi  level)  approximation,  as  described  by  Fischetti  and  Laux  [5].  A  damping  scheme  is 
employed  to  ensure  a  smooth  convergence  to  the  fixed  quasi  Fermi  level  condition.  The  effect 
of  the  hole  density  on  the  electrostatic  potential  is  particularly  important  for  very  short  gate 
devices,  where  the  acceptor  density  must  be  increased  in  order  to  combat  drain  induced  barrier 
lowering  and  punchthrough. 

A  particle  replication  scheme  [6]  is  utilised  to  increase  the  sampling  capability  of  the 
simulator  in  the  channel  region  of  the  devices.  For  a  replication  factor  M,  any  particle 
entering  the  channel  region  is  replicated  M-1  times.  Particles  attempting  to  enter  an  implant 
region  are  annihilated  with  a  probability  (M-l)/M.  Correspondingly,  all  particles  in  the 
implant  regions  have  an  associated  charge  M  times  greater  than  those  in  the  channel  region. 
The  source-drain  current  is  calculated  by  summing  the  x-component  of  velocity  for  all 
particles  located  between  the  source  and  drain  implants  [7].  This  method  enables  a  much 
larger  sample  to  be  used  than  in  the  traditional  approach  of  counting  particles  emitted/injected 
at  the  source  and  drain  contacts. 

Several  authors  have  modelled  surface  scattering  in  MOSFETs  by  allowing  both  specular  and 
diffusive  reflection  of  particles  which  impinge  on  the  oxide  interface  [8].  Our  program  allows 
any  proportion  of  diffusive  reflections  to  be  specified;  however,  for  this  particular 
investigation,  we  have  used  specular  reflections  only.  The  extent  of  surface  scattering  is 
primarily  important  in  determining  the  relative  speed  of  delta-doped  and  conventional 
MOSFETs,  an  issue  which  will  be  the  subject  of  a  separate  investigation.  For  the  same  reason 
we  have  not,  at  this  stage,  included  any  sophisticated  description  of  impurity  scattering  in  the 
delta-doped  layer;  such  refinements  may  readily  be  made,  as  and  when  required. 


The  program  rxins  for  typically  2-5  hours  per  bias  point  on  an  HP  710  workstation.  The  actual 
run  time  depends  on  the  device  size  and  bias  conditions,  with  simulations  of  shorter  devices 
generally  taking  less  time.  Access  to  a  large  networixd  cluster  of  woricstations  at  Newcastle 
means  that  a  simulations  for  a  whole  set  of  bias  points  can  be  run  in  parallel,  on  the  same 
timescale. 

m.  Results 

We  have  modelled  a  0.1pm  delta-doped  MOSFET  with  the  following  specification.  The  delta 
doping  dose  was  lO^^cm'^  -  implemented  as  a  20A  doping  plane,  with  a  bulk  doping  density 
of  5xl0l*cm‘3.  The  delta-doped  layer  was  located  200A  below  the  Si/Si02  interface.  A 
doping  density  of  2xl0l^cm'3  was  used  for  the  source  and  drain  implants:  whilst  this  value  is 
lower  than  those  encountered  in  some  MOSFET  structures,  it  was  found  sufficient  to  define  a 
flat  potential  across  the  implant  region  -  hence  providing  a  suitable  model  of  an  ohmic 
contact.  The  implant  depth  was  taken  as  500A,  and  a  length  of  lOOOA  of  each  implant  was 
included  in  the  simulation.  Again,  this  length  was  found  sufficient  to  model  the  ohmic  contact 
and  the  surrounding  potential.  The  use  of  higher  implant  doping  densities,  or  the  inclusion  of 
a  greater  length  of  implant,  leads  to  large  increases  in  CPU  time,  with  no  significant  gain  in 
physical  information  or  quantitative  accuracy. 

Short  gate  MOSFETs  can  suffer  from  high  leakage  currents  due  to  drain  induced  barrier 
lowering.  The  effect  can  be  alleviated  by  increasing  the  substrate  doping  and,  in  common 
with  previous  reports  on  0.1pm  conventional  MOSFETs,  we  have  specified  a  doping  density 
of  1O17c0i-3  for  the  p-type  silicon  substrate.  Threshold  voltage  shifts  are  also  a  problem  in 
short  gate  FETs:  we  have  chosen  an  oxide  thickness  of  50A,  in  order  to  reduce  the  gate 
voltage  swing  required  to  turn  the  device  off.  Assuming  an  n-type  polysilicon  ohmic  contact, 
our  simulations  predict  a  threshold  voltage  of  around  -l.SV  for  the  device  under 
investigation.  Obviously,  the  threshold  voltage  varies  with  the  delta  layer  depth  and  dose.  We 
have  used  a  drain  bias  of  IV  in  our  simulations,  reflecting  the  reduced  supply  voltage 
anticipated  for  use  in  ULSI. 


Figure  1.  Steady  state  election  distribution:  (a)  Vg=0V,  Vp=lV  (b)  Vq=0.75V,  Vj5=lV 


Figures  1(a)  and  (b)  show  instantaneous  plots  of  the  2D  electron  distribution  in  the  device, 
during  steady  state  operation,  for  gate  biases  of  OV  and  0.7SV.  Figure  1(a)  shows  conduction 
occurring  well  below  the  oxide  interface;  however,  it  is  clear  that  the  delta  layer  does  not 
provide  strong  carrier  confinement,  with  electrons  travelling  in  a  very  broad  current  channel. 
On  the  other  hand,  in  figure  1(b),  for  Vq  =  0.75V,  an  inversion  layer  has  formed  at  the 
interface;  the  device  is  now  operating  primarily  in  a  surface  channel  mode.  This  type  of 
transition  has  also  been  observed  in  drift-di^sion  modelling  of  long  channel  devices  [9]. 


Figure  2.  Conduction  band  edge  profile  vs.  vertical  distance,  with  4000A  corresponding  to  the  oxide  interface. 
For  each  gate  bias,  a  set  of  six  slices'  of  the  potential  profile  are  shown,  taken  at  200A  intervals  along  the  channel 
from  souce  to  drain.  The  profiles  corresponding  to  the  endpoints  of  the  channel  are  pinned  close  to  the  source 
and  drain  biases  for  most  of  the  vertical  distance  shown,  whilst  the  other  four  profiles  appear  in  descending  order 
moving  along  the  channel  from  source  to  drain. 


Figure  3.  Electron  density  vs.  distance  from  source  to  drain:  (a)  at  the  oxide  interface,  (b)  along  the  delta-layer 


Figure  2  shows  several  sets  of  vertical  ID  slices  through  the  conduction  band  edge  profile, 
for  a  range  of  gate  biases.  In  this  figure  it  can  be  seen  that  the  potential  'notch'  caused  by  the 
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delta  layer  is  quite  shallow  and,  as  the  gate  bias  is  increased,  the  potential  at  the  oxide 

interface  drops  below  that  in  the  delta  layer,  along  most  of  the  channel  length.  Figures  3(a) 

and  (b)  show  the  electron  density  along  the  surface  layer  and  the  delta  layer,  for  the  same 

range  of  gate  biases.  Comparing  the  two  figures,  it  is  clear  that  the  surface  electron  density  i 

exceeds  the  delta  layer  density  for  all  the  gate  biases  shown  except  for  Vq  =  0.  This 

conclusion  is  consistent  with  the  potential  profiles  shown  in  figure  2,  which  also  indicate 

greater  surface  accumulation  for  Vq  >  OV.  At  Vq  =  OV  the  drain  current  for  this  device  is 

0.21mA  per  micron  gate  width.  For  comparison,  we  simulated  a  0.1  pm  conventional 

MOSFET  with  the  same  implant,  oxide  and  substrate  specification.  Approximately  the  same  { 

drain  current,  0.21mA/pm,  was  obtained  for  a  gate  bias  of  0.3V.  In  figure  4  we  have 

compared  the  surface  electron  densities  for  the  two  devices.  The  surface  electron  density  for 

the  delta  doped  device  is  less  than  one  third  of  that  in  the  conventional  MOSFET,  for  the  same 

drain  current.  An  equal  current  comparison  was  also  performed  for  the  Vq  =  0.25V  bias 

condition  on  the  delta  doped  device,  and  a  reduction  in  the  surface  electron  density  -  by  a  4 

factor  of  approximately  2.5  compared  to  the  conventional  device  -  was  still  found,  even 

though  the  delta-doped  MOSFET  showed  appreciable  surface  conduction. 


Figure  4.  Surface  electron  density  vs.  distance  from 
source  to  drain  for  a  0.1  um  delta  doped  MOSFET 
(dFET)  and  a  conventional  surface  channel  device 
(sEET) 


Obviously,  a  different  result  may  be  obtained  by  varying  the  depth  of  the  delta-layer  below  the 
oxide  interface.  We  have  carried  out  simulations  of  devices  with  delta-layer  depths  of  1(X)A, 
which  show  that  the  conducting  channel  spreads  up  to  the  oxide  interface,  even  when  no 
surface  inversion  layer  is  formed.  Conversely,  further  reductions  in  surface  carrier  density  may 
be  achieved  by  increasing  the  delta-layer  depth,  but  this  reduces  the  transconductance  and  the 
threshold  voltage:  beyond  a  certain  depth,  gate  control  can  be  lost  altogether  [9]. 

rv.  Conclusions 

We  have  used  a  2D  multilayer  MOSFET  Monte  Carlo  simulator  to  investigate  the  operation  of 
0.1pm  delta-doped  MOSFETs.  The  devices  have  an  operating  'window'  of  gate  biases  within 
which  the  conducting  channel  is  centred  on  the  delta-doped  layer.  In  this  mode,  a  substantial 
reduction  in  the  surface  carrier  density  can  be  obtained,  relative  to  conventional  MOSFETs, 
implying  consequent  reductions  in  surface  scattering  and  gate  injection.  Outside  this 
operating  window,  a  parasitic  surface  channel  is  formed,  and  the  above  advantages  are 
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diminished.  The  operating  window  is  lar^  en<High  to  be  compatible  with  logic  operation  at  a 
reduced  supply  voltage  of  IV. 
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Abstract 

As  semiconductor  devices  continue  to  shrink  in  size,  quantum  effects  are  begmning  to 
bectxne  important  in  didr  operation.  In  this  piq)er,  we  discuss  some  approaches  for  inclu^g 
the  quantum  effects  in  device  modeling. 


L  Introductkm 

The  nransport  of  carriers  in  semiconductors  and  in  ultra-small  semiconductor  devices  has  Icmg 

been  a  subject  of  much  interest,  not  only  for  material  evaluation,  but  also  in  the  realm  of  i 

device  modeling  and.  mote  importantly,  as  an  illuminating  tool  fw  delving  into  the  physics 

governing  the  interaction  of  elecmons  (or  holes)  witii  their  environment  The  scaling  of  ULSI 

(tevice  dunensions  to  future  chips  indicates  that  we  will  eventually  see  devices  with  gate 

lengths  at  the  0.05  pm  level.  Very  few  laboratorfos  have  inoduced  even  working  research 

transistors  with  gate  lengths  on  the  50  run  scale  and  little  is  urulerstood  about  the  limitations 

(from  the  physics)  that  will  determine  whether  or  not  these  devices  are  practical.  On  the  i 

other  hand,  when  devices  of  30  nm  (or  less)  gate  lengths  are  made,  it  is  found  that  their 

perfimnance  is  different  from  that  of  current  I%Ts.  Research  devices  with  gate  leng^  of 

25-80  nm  clearly  show  that  tunneUng  through  the  gate  depletion  region  is  a  major  contributor 

(if  not  the  dominant  contributOT)  to  current,  and  gate  control  is  much  redut^  due  to  this 

effect  [1-3].  Even  in  MOSFETs,  quantization  is  fcaind  to  occur  in  the  charuiel.  which  affects 

the  overall  performance  of  the  devices  [4].  In  consequence,  it  appears  that  noore  detailed  i 

modeling  of  quantum  dTects  needs  to  be  included  in  device  modeling  for  future  ultrasmall 

devices. 

There  are  several  approaches  that  have  been  used  to  model  quantum  effects  in  semiconductor 

devices  (of  varying  levels  of  device  complexity).  In  this  paper,  we  will  try  to  give  a  short 

overview  of  some  of  these  qrproaches,  and  indicam  how  th^  are  similar  and  how  they  differ.  i 

hi  the  next  section,  we  discuss  how  quantum  modeling  differs  from  semi-classical  modeling. 

We  then  turn  to  a  description  of  the  various  quantum  "distribution"  functions,  discuss  thw 
equations  of  motion,  and  the  levels  of  complexity.  In  each  case,  simple  examples  are 
described  where  the  approach  has  been  used  with  smne  effectiveness. 


n.  How  does  Quantum  Modeling  Differ  from  Classical  Modeling? 

(jenerally,  modeling  of  quantum  phenomena  is  xaon  cmiqiiicated  than  modeling  of  classical 
and/or  semi-classical  phenomena.  For  instance,  the  energy-cemserving  delta  fui^on  used  in 
computing  scattering  rates  with  the  Fermi  golden  rule  is  no  longer  valid,  as  energy  and 
momentum  become  sq»rate  dynamical  variables.  Thus,  we  are  forced  to  add  a  method  of  ^ 

confuting  the  spectra  density,  which  relates  the  energy  to  the  naomentum,  in  addition  to 
having  to  conqmte  non-equUiMum  distribution  fiinctions  (or  various  moments  of  these).  To 
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be  sure,  in  some  ^)proaches  this  is  finessed  by  using  single-time  functions,  such  as  the 
density  matrix  and  the  Wigner  distribution  function,  which  essentially  integrate  out  the 
spectral  function.  But,  this  is  accompanied  by  the  full  non-local  nature  of  the  potential 
interactions  becoming  explicit  in  the  dynamical  variables;  i.e.,  the  potential  becomes  a  two- 
point  function.  Let  us  consider  further  how  this  nonlocality  arises.  Consider  a  simple 
potential  barrier  V(x) »  Von(-x),  where  u(x)  is  the  Heavyside  st^  function.  We  assume  that 
there  is  some  density  existing  in  the  region  x  >  0,  and  the  question  is  how  the  density  varies 
near  the  barrier,  a  quite  typi(^  problem  in  introducmry  quantum  mechanics,  except  here  we 
have  a  statistical  mixed  state  to  describe.  In  Fig.  1,  we  show  the  Wimer  distribution  function 
for  this  case  (fOT  parameters  approimate  to  GaAs,  with  n  =  2x  10^'  cm'^)  for  Vq  -> «».  We 
note  that  far  from  the  barrier,  the  distribution  ^iproaches  the  classical  Maxwellian  form,  but 
near  the  barrier,  it  differs  greatly.  The  repulsion  from  tiie  barrier  is  required  by  the  vanishing 
of  the  wave  function  at  the  barrier,  but  the  first  peak  away  from  tire  barrier  (in  the  wave 
functitHi)  occurs  closer  to  the  barrier  for  higher  monrentum  states.  This  leads  to  much  of  the 
complication  evident  in  the  figure.  The  overshoot  occurs  to  accomodate  the  need  for  total 
charge  neutrality.  Classically,  in  the  absence  of  self-consistency,  the  density  would  be 
uniform  up  to  the  barrier,  and  the  differences  are  the  result  of  the  quantum  mechanics.  This 
variation  exists  over  a  distance  of  the  order  of  several  thermal  de  Broglie  wavelengths,  which 
provides  a  spatial  scale  length.  In  GaAs,  at  3(X)  K,  this  is  about  S  nm  for  electrons,  and  of 
course  increases  with  the  inverse  square  root  of  the  temperature  as  the  thermal  de  Broglie 
wavelength  is  given  by  Xd  =  Thus,  nonlo^  variations  can  be  expected  over 

a  range  of  10-20  nm  even  at  room  temperature! 


Figrue  1.  The  Wigner  distribution  function  for  an  infinite  barrier,  in  arbitrary  units  [5]. 


It  is  clear  that  the  density  no  longer  varies  simply  as  exp(-pv),  where  P  is  the  inverse 
temperature,  and  that  modifications  to  the  statistical  mechanics  need  to  be  made.  The 
developirrent  of  quantum  corrections  to  statistical  thermodynamics,  especially  in  equilibrium, 
has  a  rich  and  old  history.  Unfortunately,  there  is  no  consensus  as  to  the  form  of  the  quantum 
potential  correction  to  this  simple  exponential.  If  we  could  find  such  a  correction,  it  could  be 
utilized  in  the  semi-classical  hydrodynamic  equations,  and  most  further  quantum 
complications  ignored.  The  various  forms  of  the  quantum  potential,  for  use  in  classical 
hydr^ynamic  equations,  has  recently  been  reviewed  [6].  One  form  that  has  been  used  [7] 
introduces  a  quantum  pressure  term  as  a  modification  of  the  electron  temperature,  through 


=  1^7’e  - 


(1) 


although  other  work  has  reduced  the  last  term  by  a  factor  of  3  [8,9].  The  form  of  the  last 
term  in  (1)  was  originally  derived  by  us  and  is  often  termed  the  Wigner  potential  HO]. 
Although  the  results  obtained  using  this  model  are  in  agreement  with  the  intuitive 
expectations,  it  should  be  noted  that  the  correction  term  is  an  average  and  does  not  have  the 


momentum  dependence  expected  from  Fig.  1.  A  more  recent  derivation  overcomes  this 
limitation  [6],  but  has  not  bren  tested  in  actuid  simulations  as  yet 

Another  problem  with  the  use  of  quantum  approaches  to  device  modeling  is  that  most 
quantum  discussions,  especially  those  of  quantum  transport,  tend  to  revolve  around  closed 
systems,  whereas  most  devices  are  open  systems.  In  treating  such  open  systems  quantum 
mechanically,  it  is  quite  difficult  to  propi^y  define  the  reservoir  (thern^  equilibrium) 
regions,  as  well  as  tlw  contact  regions  between  the  reservoirs  and  the  active  ttevice  region. 
Because  of  the  nonlocal  nature  of  the  quantum  system,  errors  in  defining  the  contact  region 
will  propagate  throughout  the  device,  often  leading  to  spurious  results. 


m.  Approaches  to  Quantum  Distributions 

Why  don't  we  just  solve  the  SchrOdinger  equatitm  for  a  given  potential  distribution,  and  then 
weight  various  solutions  with  a  Fermi-Dirac  distrilMition?  This  q>proach  actually  wreks  well 
in  equilibrium  situations,  or  when  we  know  in  detail  the  properties  of  all  contact  regions  and 
the  exact  potential  structure  within  the  active  area.  However,  finding  the  steady-state 
solutions  with  the  above  approach  often  entails  mxe  work  than  using  one  of  the  techniques 
for  direcdy  finding  a  quantum  distribution  self-consistently  within  the  entire  device  domain. 
A  m(He  important  reason  lor  not  a  simple  Schrddinger  equation  solver  (plus  Fermi-Dirac 
weighting  factors)  for  a  distribution  is  that  the  actual  distribution  in  the  active  region  is  a  very 
non-equilibrium  ^stribution  which  we  must  find  as  part  of  the  modeling  problem.  But,  the 
approaches  discussed  below  are  develt^ied  from  the  Schrddinger  ^uation;  we  are  solving  a 
more  general  function,  which  incorporates  the  solution  of  this  equation  for  an  entire 
functional  set.  In  addition,  since  the  Schrddinger  equation  just  defines  a  wave  function, 
which  is  one  part  of  the  density  a  representation  for  an  electron),  it  is  quite  difficult  to 
incorporate  dissipation  through  scattering  mechanisms.  Nevertheless,  the  starting  point  for 
all  of  our  approaches  lies  in  a  mixed  state  wave  function  ^(x,t),  which  is  taken  to  be  a  field 
operator  describing  the  degree  of  excitation  of  the  various  states  of  the  system  (this  is  one 
method  of  conveniently  describing  the  mixed  state  of  the  system).  Depending  upon  the 
Hamiltonian,  this  wave  function  can  be  a  one-electron  wave  function  or  a  many-body  wave 
function. 


A.  The  Density  Matrix 

The  density  matrix  is  formed  from  the  composite  of  two  such  wave  functions  described 
above.  It  may  be  written  as 

p(xjt'.r)  =  'F(x,/)4«-(x',/) .  (2) 

where  the  "  symbol  on  the  second  wave  function  indicates  the  Hermitian  adjoint  function. 
This  is  an  equal  time  function  and  describes  the  correlation  between  events  at  positions  x  and 
x'.  Obviously,  <p(xjt,r)>  =  <^(x,/)4'+(x,r)>  =  n(x,t)  defines  the  local  density  of  particles. 
Here,  we  have  taken  an  expectation  of  the  density  operator,  since  the  definition  in  (2)  is 
obviously  that  of  an  operator.  The  equation  of  motion  arises  from  the  Liouville  equation.  It 
may  be  written  as  (in  the  ateence  of  dissipative  processes) 

where  the  last  term,  in  tiie  square  brackets,  is  a  shon-hand  notation  for 
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[8Uih(|8*V)V]  -  ^[V(R  +  |)  -  V(R-|)]  .  (4) 

and  we  have  introduced  the  cooidinate  tnuisfonnadons 

R  «  j(x  +  x')  .  s  »  a  -  *•  .  (5) 

The  density  matrix  has  been  used  direcdy  to  study  a  number  of  devices.  In  Fig.  2(a),  we 
show  the  density  matrix  for  a  double-barrier  resonant  tunneling  diode  (DBRTD)  in 
equilibrium.  Few  comparison,  we  also  show  die  Wigner  distribution  function  (described 
below)  for  this  structure  in  Fig.  2(b).  Both  calculadons  are  for  barriers  0.3  eV  5  nm 
wide,  and  separated  by  5  nm.  Both  are  within  lighdy  doped  regions  adjacent  to  the  barriers, 
S  nm  wide  for  the  Wigner  function  and  7.5  nm  wide  for  the  density  matrix.  In  bcMh 
calculations,  the  nominal  density  was  10^^  cm’^,  and  Fermi  statistics  were  applied  at  the 
boundaries.  For  the  density  matrix  in  Fig.  2(a),  this  is  lepie^ted  by  a  dandled  oscillation  in 
the  nonloosl  coordinate,  whose  period  decreases  as  the  drasity  increases.  Etensity  is  obtained 
from  the  diagonal  compment,  which  fev  the  DBRTD,  shows  a  small  buildup  of  charge  within 
the  quantum  well.  The  peak  of  this  charge  is  approximately  2  x  10^^  cm'^. 


Figure  2.  The  double-barrier  resonant  tunneling  diode  in  equililnium:  (a)  the  density  matrix 
[10],  (b)  the  Wigner  distribution  function  [13]. 

For  the  density  matrix  under  dyiiamic  current  flow  conditions,  dissipation  is  inemporated  and 
serves  to  couple  the  real  and  imaginary  parts  of  the  density  matrix.  Current  boundary 
cmiditicHis  are  reprinted  by  a  displaced  (Ustribution  function,  similar  to  a  displaced  Fermi 
distribution.  Dissipation  is  introduced  as  a  phenomenological  scattering  potential  whose 
diagcmal  components  have  the  properties  of  a  dynamic  quasi-Fermi  level  [11].  At  low  values 
of  bdas,  the  scattering  potential  has  the  fexm  (x  -  x')[J/cp(x,x)]p(x,x'),  which  is  similar  to  that 
discussed  in  [12].  Here  J  and  t  represent  current  arid  scattering  time,  respectively.  This  form 
of  scattering  conserves  the  total  number  of  particles.  Fot  simple  barriers,  the  current-voltage 
characteristics  display  the  expected  exponential  dependence  on  potential  energy,  with 
accumulation  at  the  emitter  side  of  the  barrier  and  depletion  on  die  coUector  side. 

The  cmnputational  procedures  are  described  in  detail  in  [12],  and  Ixiefly  may  be  described  as 
re-expressing  (3)  as  a  coupled  first-order  system  of  equations,  and  seeking  solutions  along 
characteristic  dioKtions  for  the  coupled  equations.  All  of  the  calculations  incorporate  equally 
spaced  grids,  and  a  coupled  Poisson  solver.  For  simple  barriers,  the  current-voltage 
characteristics  display  the  expected  exponential  dependence  on  potential  energy,  with 
accumulation  at  the  emitter  side  of  the  biurier  and  deletion  on  the  collector  side.  This  is 
shown  in  Fig.  3  for  a  0.3  eVbarrier,  15  nm  thick,  nriiich  is  embedded  in  a  30  nm  lightly  doped 
region  (GaAs).  In  Hg.  3(a),  the  rc^  part  of  the  density  matrix,  which  is  symmetric  about  the 
diagorul  and  shows  charge  accumulation  on  the  emitter  side,  llie  imaginary  part  is  shown  in 
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Fig.  3(b),  and  is  anti-symmetric,  as  its  derivative  along  the  ncmlocal  diiectioo  yields  the 
current  We  illuretate  die  computed  cuirent-vttitage  relationship  in  Rg.  3(c). 


(a)  (b) 


Figure  3.  The  DBRTD  under  bias,  near  the  vaU^  of  the  I-V  curve.  Parts  (a  and  b)  illustrate 
the  real  and  imaginaiy  parts  of  the  density  matrix,  respectively,  while  the  I-V  curve  itself  is 
shown  in  (c). 
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B.  The  Wigner  Distribution  Function 

The  Wigner  distribution  becomes  important  when  the  physical  problem  is  one  that  is  better 
understood  in  terms  of  a  phase-space  distribution,  and  the  carrier  distribution  function  in  an 
inhomogeneous  device  is  one  such  problem.  This  phase-space  distribution  is  not  easily 
represented  by  the  density  matrix  itself,  but  the  Wigner  distribution  attempts  to  pr^nt  an 
analogy  between  quantum  and  classical  phase  space  for  statistical  mechanics.  Since  the 
statistical  picture  in  phase  space  is  well  u^erstood,  indeed  uses  the  Boltzmann  equation  for 
classical  mechanics,  transfOTining  to  a  similar  picture  in  quantum  statistical  mechanics  allows 
the  physical  picture  of  a  problem  to  be  better  understood.  Unfortunately,  position  and 
momentum  do  not  commute  in  quantum  mechanics,  and  the  two  cannot  be  measured 
simultaneously  to  any  great  accuracy  in  phase  space,  llus  appears  in  the  Wigner  picture  by 
regions  of  the  phase  space  in  which  the  distribution  is  negative  in  value.  When  the 
distribution  is  integrate  over  all  space,  the  probability  density  in  momentum  space  is 
recovered,  and  this  quantity  is  positive  d^inite.  When  the  distribution  is  integrate  over  all 
momentum,  the  probability  density  in  real  space  is  recovered,  and  this  quantity  is  also 
positive  definite.  In  fact,  if  the  Wigner  distribution  function  is  coarse-grain  aveta^  over  a 
region  of  phase  ^ace  corresponding  to  a  six-dimensional  volume  element  whose  size  is  set 
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by  the  unceruunty  principle,  the  result  is  again  a  positive-definite  "averaged”  function.  The 
Wigner  distributicMi  function  is  defined  from  the  density  matrix  through  the  Fourier 
tra^ocm,  often  called  a  Weyl  transform, 

Fw(R.P.r)  «  <P<*  - 1)>  •  (6) 

which  in  a  sense  is  a  Fourier  transform  on  the  variaUe  that  measures  the  distance  from  the 
diagonal  in  die  density  matrix  p(x.  x*).  Since  the  Wigner  distribution  is  a  c-number.  an 
expectation  has  been  indicated  in  (6).  This  transformadon  accentuates  the  correladon  that 
exists  in  the  wave  foncdons  separatnl  in  posution  (if  the  correladon  exists).  The  Wigner 
function  builds  in  the  correlations  between  different  positions  that  are  inhmnt  in  the  off- 
diagonal  elements  of  the  density  matrix.  The  Wigner  function  is  evaluated  at  position  R,  but 
the  density  matrix  terms  that  are  used  in  the  Fourier  transform  are  those  at  the  two  positions 
R  ±  (s/2).  The  wave  function  may  actually  vanish  at  R,  but  the  Wigner  function  will  have  a 
nonzero  value  in  these  areas  in  which  the  wave  function  vanishes,  and  the  values  in  these 
such  regimis  are  measures  of  the  correlation  between  tlM  two  endpoints  on  the  vectm*  s.  The 
equation  of  motion  for  the  Wigner  distribution  function  is  given  by  (again,  in  the  absence  of 
dissipation) 

^  ^  p-VFw  =  ^  Jd3pW(R,p’)Fw(R.P  +  p’.0  .  (7) 

where 

A#(R,p')  =  Jd^s  eV**/*  [sinhC^S'V)  V]  .  (8) 

Equation  (7)  is  quite  similar  to  the  streaming  terms  of  the  Boltzmann  ^nation,  especially  if 
the  lowest  order  term  in  the  expansicm  of  the  pmential  is  used.  The  Wigner  distribution  has 
also  been  used  to  model  the  DBRTD  [13],  and  the  results  are  shown  in  Fig.  4,  again  for  the 
use  of  a  relaxation  time  approximation  for  the  dissipation,  and  for  a  bias  near  the  valley  of  the 
I-V  relation. 


(a)  (b) 

Figure  4.  (a)  Steady-state  Wigner  distribution  at  the  valley  of  the  1-V  curve,  (b)  The  1-V 
curve  fw  a  DBRTD. 


The  Wigner  function  in  this  simulation  shows  a  depletion  region  in  the  cathode  area,  which 
arises  from  a  ctmtact  potential  drcq)  and  the  tendentty  to  form  a  bound  state  in  this  area.  It  is 
largely  eliminated  if  a  lightly-dop^  region  is  introduced  adjacent  to  the  barrier  layers  [13]. 
Such  contact  potential  di^  are  typical  of  most  open  systems,  whether  classical  or  quantum, 
and  are  well-known  in  the  Gunn  effect  device  literature  [14].  The  depletion  was  also  found 
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tt>  go  aw»y  with  greater  amounts  of  scattering  in  detailed  studies  of  die  rede  at  scattering  on 
the  DBRTO  by  Bensley  [15].  Generally,  the  cathode  iMmers”  will  develc^  when  there  is  a 
mismatch  between  the  injection  characteristics  of  the  cadmde  reservmr  and  the  dissipative 
nature  oi  die  active  device  region.  Added  dissipation  or  additional  resistance  in  the  active 
layer  (through  the  lighdy-doped  regions)  reduces  the  mismatch,  and  thereby  reduces  the 
depletion  in  the  catho^-^t^  regitHi. 


C.  Rad-Time  Green's  Functions 

Both  the  density  matrix  and  the  Wigner  distribution  function  are  equal-time  functions,  and 
are  functions  of  only  seven  variables — either  two  vector  postions  and  the  time  or  the  vector 
position,  vector  momentum,  and  the  time,  respectively.  The  energy  does  not  enter  into  either 
description.  This  resulted  from  the  definition  thu  was  used  in  (2).  but  there  was  no  real 
requirement  to  have  defined  it  in  this  manna-.  We  could  as  easily  have  written  the  two  wave 
functions  at  different  times,  and  it  is  possible  to  define  another  Unction,  which  is  a  function 
of  two  times,  through 

G<(x.t.x*.0  =  -  i<^+(x'jy¥(xj)>  .  (9) 

The  equal  time  version  of  (9)  is  obviously  related  to  the  density  matrix  itself.  There  are  in 
fact  a  group  of  real-time  Gian's  functions,  arising  fienn  the  different  ways  in  which  the  wave 
functions  can  be  combined  and  temptvally  order^  Any  simulation  problem  must  solve  for 
the  independent  members  of  this  group  (four  in  number)  [16].  The  particular  Green's 
function  in  (9)  is  the  "less  than"  function,  and  is  closely  related  to  the  density  in  the  otha 
distributions  above.  Introducing  the  change  of  variables  (S),  and  the  equivalent  for  the 
average  time  T,  and  the  difference  time  x,  we  can  introduce  the  enagy  £  (=^)  through  an 
additional  Fouria  transform,  as 

G<(R.p,©,T)  =  i  / erdk-im  <S[^(R  +  |.r + 5)‘F+(R  -|,r - 1)>  d^sdx  .  (10) 

It  is  clear  that  the  x  =  0  limit  of  (10)  will  lead  to  our  Wigna  distribution  function,  and 


Fw(R,p,T)  =  J  do)  G<(R,p,®,r) 


(11) 


Much  more  information  is  contained  within  the  Green's  function  formalism,  since  we  can 
now  investigate  in  detail  the  spectral  density  itself,  which  relates  the  energy  to  the 
momentum.  Howeva,  very  little  has  done  with  these  Green's  functions  in  acturi  device 
modeling.  However,  they  have  been  used  to  study  high-field  transport  in  homogeneous 
systems  H^].  and  simplified  versions  have  been  us^  to  study  the  DBRTD  [17]  for  a  non¬ 
self-consistent  potentiri  and  weak  scattering  from  phonons  (but  introduced  without  resating 
to  a  relaxatioi  time  approximation).  NeverAeless,  the  results  are  suggestive  and  indicate  that 
quite  detailed  quantum  device  modeling  can  be  carried  out  wi^  the  real-time  Green's 
functions. 


IV.  Condusions 

Ova  the  past  few  years,  many  groups  have  begun  to  explore  quantum  methodologies  for 
modeling  real  semiconducta  devices  (at  real  temperatures).  Most  of  tiie  various  approaches 
are  closely  related  to  each  otha,  and  offa  different  ways  of  approaching  any  given  problem. 
While  none  of  the  techniques  htu  become  well  develt^)^  all  of  the  ones  discussed  here  have 
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been  shown  to  lead  to  useful  results  and  have  given  added  insight  into  the  inol^in.  We  are 
now  reusing  the  point  at  which  we  are  trying  to  understand  the  methodok^.  and  are  in  a 
positUM  whm  we  can  now  confidently  use  &  methods  to  study  device  physics.  Even  so, 
many  problems  of  understanding,  particularly  in  the  quantum  statistical  mechanics 
interpreudons  soil  remain,  and  wul  lead  to  many  interesting  lines  at  inquiry  in  the  near 
future. 
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Abstract 

The  purpose  of  this  work  is  to  investigate  the  boundary  conditions  for  the  potential  at 
exposed  semiconductor  surfaces  in  split-gate  structure.  A  two  dimensional  numerical  ap¬ 
proach  is  presented  for  the  coupling  between  the  non-linear  Poisson  equation  in  the  semi¬ 
conductor  (Finite  Element  Method)  and  Laplace’s  equation  in  the  dielectric  (Boundary 
Element  Method).  The  utility  of  the  coupling  algorithm  is  demonstrated  by  simulat¬ 
ing  the  potential  distribution  in  a  n  —  AlGaAsfGaAs  quantum  wire  structure  with  a 
semi-classical  Thomas- Fermi  charge  model. 

I.  Introduction 

Recent  advances  in  nanostructure  fabrication  have  made  it  possible  to  fabricate  struc¬ 
tures  in  which  a  two-dimensional  layer  of  electrons  is  further  confined  into  quantum  wires 
or  dots.  Typically,  such  device  structures  are  defined  by  metallic  split  gates.  In  order  to 
understand  the  potential  distribution  in  those  structures,  we  solve  Poisson’s  equation  in 
the  two-dimensional  problem  domain, 

£VV  =  -P,  (1) 

where  <t>  is  the  electrostatic  potential,  e  is  the  dielectric  constant,  and  p  is  the  charge 
density  [1].  Since  this  is  a  boundary  value  problem,  one  needs  to  know  the  values  of  the 
potentiab  and/or  fluxes  at  the  boundary.  This  is  a  crucial  problem,  especially  at  the 
exposed  semiconductor  surface. 

In  recent  studies  [l]-(7],  the  problem  domain  is  typically  identical  with  the  semi¬ 
conductor  region,  schematically  shown  in  Fig.  1.  The  commonly  used  model  for  the 
boundary  conditions  on  the  exposed  semiconductor  surface  is  either  a  Dirichlet  boundary 
condition  [2], [3]  or  a  Neumann  boundary  condition  (4), (5).  Obviously,  both  of  these  models 
have  their  limitations,  especially  for  very  narrow  split-gate  structures  used  for  quantum 
devices. 

In  this  paper,  we  adopt  an  alternative  viewpoint  and  develop  an  algorithm  to  im¬ 
plement  boundary  conditions  at  exposed  semiconductor  surfaces.  We  view  as  the  natural 
problem  domain  the  semiconductor  and  the  dielectric,  as  shown  in  Fig.  2.  Thus  the  arti¬ 
ficial  boundary  conditions  at  the  exposed  surface  are  replaced  by  more  physical  matching 
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conditkm  at  the  interface  between  the  semiconductor  and  the  dielectric.  Specifically,  this 
algoritnm,  nrferred  to  as  FBEM,  couples  a  Finite  Element  solution  of  Poisson’s  equation 
in  the  semiconductor  to  a  Boundary  Element  solution  [8]  of  Laplace’s  equation  in  the 
dielectric  with  matching  conditions. 


•xpoMd  turface 


SI-GaAs  Subtnie 


200  nmi 


?  H 


^4 

Dielectric 


SI-CaAs  Subeiraie 


Figure  1:  Typical  problem  domain  for  quan¬ 
tum  devices  defined  by  metallic  gates.  Shown 
are  the  multi-layer  semiconductor  regions  and 
the  exposed  surface  between  the  gates. 


Figure  2:  The  problem  domain  consisting  of 
the  semiconductor  regions,  and  and 
the  dielectric  region,  the  semiconductor 
exposed  surface  B-C  is  treated  as  the  interface 
between  the  semiconductor  and  the  dielectric. 


In  Soction  II,  we  present  the  problem  statement  with  the  formulation  of  the  boundary 
conditions  on  the  exposed  surface.  In  Section  III,  we  discuss  our  numerical  problem 
formulation.  In  Section  IV,  we  present  example  results  of  the  FBEM  calculation  for  a 
n  —  AlGaAsfGaAs  quantum  wire  structure  under  bias  conditions,  and  we  compare  them 
to  those  obtained  with  the  usual  Dirichlet  and  Neumann  boundary  conditions. 

II.  Problem  Statement 


We  solve  the  two  dimensional  potential  problem  in  the  domaun  shown  in  Fig.  2, 
where  regions  fli  with  boundary  dCli  and  fi]  with  boundary  dHa  are  semiconductor 
domains  (non-linear  Poisson  equation)  and  region  Qj  with  boundary  dQj  is  the  dielectric 
domain  (Laplace’s  equation).  The  simulation  variable,  u,  is  defined  as  the  potential 
difference  between  the  conduction  band  edge  Ec  and  the  Fermi  energy  Ep  in  units  of 
the  thermal  energy  kT,  i.e.,  u  =  {Ec(<t>)  -  EF)lkT  =  {E^  —  e4>  —  EF)lkT,  where  E^  is 
the  conduction  band  edge  in  the  bulk.  The  boundary  C-B  is  the  exposed  semiconductor 
surface.  Mathematically  C-B  is  the  interface  between  the  region  fl}  and  the  region 
Across  this  interface,  the  potential,  u,  is  continuous  and  the  jump  in  the  normal  electric 
flux  density  is  equal  to  the  interface  charge  density.  The  problem  to  be  solved  can  be 
posed  as  follows: 

«  =  Ui,  U2,  Uj,  (2) 


such  that 


€iV\{x,  y )  =  -f[ui{x,  y )],  (*,?)€  Hi, 

«rfV*Urf(x,  y )  =  0,  (®,  y )  €  fij, 


/=1,2, 


and  with  interface  matching  conditions  at  the  exposed  surface.  dflBC- 

U]  —  =  0,  on  dQcB,  (5) 

=  -^Qint,  on  dttcB,  (6) 

where  /(«<)  —  ft  —  ep{ui)fkT,  is  the  charge  density  term  in  the  domain  (li,  and  Q,nt 
is  the  interface  charge  density  on  the  interface  dflcB-  Generally  Q.nt  and  p  may  be  a 
function  of  the  potential  u,  that  is,  Qi^t  =  Qint{n)  and  p  =  p(u). 

III.  Numerical  Formulation 


The  semiconductor  domain  fl,,  fl,  =  fli  U  02,  where  the  non-linear  Poisson  equa¬ 
tion  (3)  governs,  is  discretized  in  a  way  suitable  for  the  application  of  the  Finite  Element 
Method  (FEM).  The  resultant  non-linear  system  of  equations  is: 

KiiU*  -b  Ki2Ug^  =  Pj, 

d"  ^n^Bc  ~  Pbc'  (^) 

where  u^c  and  pgc  contain  the  potentials  and  nodal  forces,  respectively,  at  the  nodes  on 
the  interface  dOac  between  the  semiconductor  and  the  dielectric,  u'  and  contain  the 
potential  and  nodal  forces  at  all  other  nodes  in  the  semiconductor  domain,  respectively, 
and  K  is  the  stiffness  matrix. 

The  dielectric  domain,  fl^,  is  a  homogeneous  charge  free  region.  The  governing 
equation  is  Laplace’s  equation.  Since  the  fundaunental  solution  of  Laplace’s  equation  (4) 
is  known,  a  boundary  integral  equation  technique  can  be  employed.  The  resultant  system 
of  equations  can  be  expressed  as: 

SllUg  +  S^Ugc;  =  p^, 

S21U0  4-  S22UBC  =  Pbc-  (®) 

where  S'*  is  the  equivalent  stiffness  matrix  and  p**  is  the  equivalent  nodal  force  vector. 

Writing  the  matching  conditions  at  the  exposed  surface,  (5)  and  (6),  in  discretized 
form, 

Ubc  ~  ~  (9) 

Pbc  Pbc  ~ ‘I- 

A  new  global  system  of  equations  is  formed  by  coupling  the  dielectric,  equations  (8), 
with  the  semiconductor,  equations  (7),  and  enforcing  the  matching  conditions  (9)  and 
(10).  vspace-O.lin 


Solution  of  this  set  yields  the  potential  distribution  in  the  semiconductor  domain, 
including  the  interface  dHaci  &nd  the  nodal  flux  on  BTIbc- 


Si,  S,2  0  0  \  /  ui\  /p^\ 

S21  S22  0  I  ubc  _ 

0  Ki2  K,,  0  u*  pj 

0  K22  Kfj  —1/  \Pbc/  V  ®  > 
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IV.  Example 


The  example  structure  with  its  dimensions  is  shown  in  Fig.  2.  The  quantum  wire  is 
realized  at  the  Alo,3GaojAs/GaAs  hetero junction  and  is  defined  by  the  metal  gates  on 
the  top  surface.  The  gate  electrodes  have  an  applied  voltage  Vg.  Between  the  gates  is 
the  exposed  semiconductor  surface  where  the  interface  charge  density  Q,nt  is  assumed  to 
be  ^xed  for  the  example  although  this  is  not  a  limitation  of  the  algorithm.  The  n-type 
doping  density  is  assumed  to  be  10‘*cm“^  in  the  AIq^Goo  tAs  layer  and  in  the 

GaAs  substrate.  A  semi-classical  Thomas-Fermi  charge  model  [2,  7]  is  assumed  in  the 
semiconductor  domain. 


Figure  3:  Sample  result  of  the  potential  land¬ 
scape  in  the  semiconductor  region.  Shown  is 
the  conduction  band  obtained  by  the  FBEM 
algorithm  for  -l.OV  gate  bias.  The  heteroint¬ 
erface  is  clearly  visible  at  the  discontinuity  of 
the  potential. 


Figure  4;  Result  of  the  FBEM  algorithm 
showing  the  condurtion  band  profiles  paral¬ 
lel  to  the  heteroihterface  (on  the  GaAs  side) 
for  different  bias  conditions. 


As  shown  in  the  center  of  Fig.  3,  a  potential  well  is  formed  underneath  the  ex¬ 
posed  surface  for  a  negative  gate  bias  of  —  I.O  V.  In  Fig.  4,  the  conduction  b2md  is 
plotted  parallel  to  the  AlGaAs/GaAs  heterointerface  (on  the  GaAs  side)  for  different 
bias  conditions.  An  electron  gas  is  formed  in  those  regions  where  the  conduction  band 
dips  below  the  semi-classical  electron  quasi  Fermi  level,  which  is  chosen  as  the  zero  of 
the  energy  axis  and  indicated  by  the  dashed  line.  Figure  5  shows  a  comparison  of  the 
results  obtained  with  our  FBEM  algorithm  auid  those  utilizing  the  conventionad  Dirichlet 
or  Neumann  boundary  conditions.  Shown  is  the  conduction  band  profile  parallel  to  the 
AlGaAsfGaAs  heterointerface  (on  the  GaAs  side)  in  Fig.  5(a),  and  the  different  meth¬ 
ods  produce  signihcaintly  different  results.  Specifically,  the  expected  width  of  the  electron 
channel  differs  by  a  factor  of  two  for  the  Dirichlet  and  FBEM  boundary  conditions.  The 
Dirichlet  boundary  conditions  also  predict  a  significantly  higher  electron  density  than  the 
FBEM  algorithm.  Figure  5(b)  shows  the  same  comparison  at  the  semiconductor  surface. 
Both  the  Dirichlet  and  Neumann  boundary  conditions  exhibit  an  unrealistic  discontinu¬ 
ous  behavior  of  the  potential,  as  opposed  to  the  physically  more  appealing  smooth  result 


of  the  FBEM  algorithm. 

In  summery,  we  have  presented  a  study  of  the  boundary  conditions  at  exposed  semi¬ 
conductor  surfaces  and  developed  an  algorithm,  termed  FBEM,  to  solve  this  type  of 
potential  problem  in  the  semiconductor  and  the  dielectric.  The  major  advantage  of  our 
algorithm  is  to  model  the  exposed  semiconductor  surface  by  imposing  the  more  physi¬ 
cal  interface  matching  conditions  without  making  artificial  assumption  about  either  the 
potential  or  the  electric  field  at  the  exposed  surface.  In  ongoing  work,  we  study  the 
implementation  of  different  physical  models  for  the  interface  charge  at  the  exposed 
surface.  These  results  will  be  presented  in  the  future. 


Distance  (nm)  Distance  (nm) 


Figure  5;  Comparison  of  the  conduction  band  profiles  under  -1.5  V  gate  bias  for  the 
three  types  of  boundary  conditions  on  the  exposed  semiconductor  surface;  (a)  parallel 
to  the  heterointerface  (on  the  GaAs  side),  and  (b)  parallel  to  the  semiconductor  surface. 
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Abstract 

Traffic  theoretic  or  queuing  methods  are  proposed  as  a  natural  framework  for  modelling  the 
correlated  tunnelling  and  transport  of  single  electron  solitons  in  coupled  tunnel  capacitor 
structures  below  the  Coulomb  blockade  threshold.  Stationary  state  and  time-dependendent 
modelling  is  developed  and  validated  against  Monte  Carlosimulation.  Exact  results  are  obtained 
for  the  double  tunnel  junction  or  quantum  dot.  The  effects  of  discrete  energy  states  are 
evaluated  and  the  extension  to  multiple  junctions  illustrated. 

I.  Introduction 

The  controlled  transport/correlated-tunnelling  of  single-electronic  excitations  in  coupled 
tunnelling  capacitator  structures  is  now  experimentally  established  in  metal-insulator,  metal- 
semiconductor  systems  and  in  capacitatively-coupled  quantum  point  contacts  in  semiconductor 
2DEG  structures[l-4].  These  structures  rely  on  the  existence  of  ultra-small  capacitative 
structures  such  that  the  effective  charging  energies  e2/2C  exceed  the  thermal  energy  kgT  [5]. 
By  exploiting  state-of-the-art  nanofabrication  it  is  possible  to  construct  20nm  scale  coupled 
capacitors  (metal  on  semiconductor  coupled  Schottl^  <lots)[6]  which  point  the  way  to  a  future 
high  temperature,  high  density  nanoelectronic  systtms  technology  (Fig.l). 


Figure  1. 40  nm  diameter  hemispherical  Alumimum  Schottky  dots  on  p-Silicon  with  IS  nm 
spacing  (Wcavcr[6]). 

Single-electronic,  ultra-small  capacitor  systems  provide  a  new  regime  of  study  for 
semiconductor  device  modelling  wherein  the  effects  of  charging  become  important  and  the 
Coulomb  interaction  provides  subtle  correlation  effects. 
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EL  Monte  Carlo  method 


Simulations  of  arrays  of  single-electronic  tunnel  junctions  (see  Hgure  2)  and  gated  arrays 
(figure  3)  have  been  carried  out  to  date  by  Monte  Carlo  methods  which  have  proved  to  be 
prcriubitive  computationally  for  the  extended  systems  of  technological  interest 
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Figure  2.  A  multi-junction  tunnelling  array.  Figure  3.  Gated  2-junction  array 

The  capacitances  C  are  tunnel  junctions.  is  taken  to  be  non-tunnelling. 

In  the  Monte  Carlo  approach[7,8]  the  state  of  the  tunnel  junction  array  is  described  by  the 
number  of  solitons  (electron  plus  polarisation  field)  at  each  node.  The  total  charge  on  each 
junction/tunnel  junction/ground  capacitance  is  the  sum  of  the  charges  due  to  the  voltage  sources 
and  the  charges  induced  by  the  soliton  structure.These  charges  are  linearly  related  to  the 
voltages  and  number  of  solitons  at  each  node.  The  evolution  of  the  array  is  then  determined  by 
the  stochastic  process  of  single  elecnxjn  tunnelling  which  alters  the  soliton  occupancy  vector  n. 

ni.  Queuing  theory:  stationary  case 

In  the  present  paper  we  introduce  a  new  approach  to  modelling  single-electronic  systems  that 
captures  the  Poisson  stochastic  nature  of  tunnel  events  and  provides  a  fast,  physically 
transparent  and  efficient  method  of  calculating  the  steady-state  charaaeristics  of  multi-junction 
configurations.  This  new  method  involves  a  re-formulation  of  the  transport  equations  in  terms 
of  of  queuing  theory  (traffic  theory [9])  and  centres  on  determining  the  distribution  Pj  of  quasi¬ 
electrostatic  soliton  excitations  that  are  formed  during  the  transport/tunnelling  process.  For  an  N 
junction  array  the  state  of  the  system  is  described  by  Sj  =  (kj,  k2, ...  kiq)  where  kj  are  the 
number  of  excess  electrons(solitons)  at  node  ki(figure  4  shows  states  for  a  2-junction  system). 


XO  XI  X2 


pO  pi  p2 


Figure  4.  Soliton  states  in  a  2-junction. 


Figure  5.  Transitions  of  soliton  states 


An  electron  may  tunnel  from  a  node  to  a  neighbouring  node  provided  a  tunnelling  conection 
exists  (connection  matrix  Xij  =  1).  Electrons  form  a  queue  at  each  arrival  node  where  they  wait 
fOT  "service"  (unlike  conventional  traffic  theory  the  queues  are  always  full  and  the  "service  rate" 


which  is  equal  to  |J.i  the  total  departure  rate  from  each  node  i  varies  as  a  function  of  the  state  of 
the  system).  The  current  passing  between  two  neighbouring  nodes  (l>lt)  which  are  connected  by 

a  tunnel  junction  with  capacitance  C^i  is  calculated  from  the  Pi  by  ;Iu=c.5Li  Pi(ri(k,l)  - 

ri(l,k))  where  Pj  is  the  probability  of  finding  the  system  in  state  Sj,  and  Fi  (k,  1 )  = 
tunnelling  rate  from  node  k  to  neighbouring  node  1  when  the  system  is  at  state  S,.The  average 
voltage  across  the  capacitor  Cy  can  also  be  calculated  as  =  Li?!  •  q^i  ( i )  /Cy^i  ■ 

To  determine  the  distribution  of  the  soliton  states  Pi,  consider  a  single  tunnel  event  from  node 

k  to  node  1  at  state  Sj  to  form  state  Sj.  Define  the  departure  rate  Pij  =  rni(K,X,)  for  Xy  = 
l,fit)m  state  Si  to  Sj  for  the  system  already  occupying  Si .  The  Poisson  nature  of  the  mnnelling 
events  allows  us  to  find  the  total  departure  rate  from  state  Sj  as;|J.j^=  Xj  Itij  and  the 
transition  probability  from  state  S,  to  Sj  is  thusflj  =  Wj  /  M-i  defing  the  routing  matrix  R  .The 
the  average  residence  time  (per  visit)  at  state  Sj  is  l/lii .  The  input  traffic  to  node  i  is  given  by 
=  Z  Xi  rij .  or  in  matrix  form,  the  traffic  erquations  are:  X .  ( I-R)  =  0  where  X  is  the 

row  vector  (X^,  X2,  .  .Xn)  .  Unfortunately,  the  matrix  (I-R)  has  a  zero  determinant  and 
therefore,  the  traffic  equations,  have  an  infinite  number  of  solutions.  To  proceed  let  us 
define  the  occupancy  vector  ni=(mi,m2,...,mn)  where  mpl  if  the  system  is  found  at  state  S, 
and  mi=0  otherwise.  It  is  clear  that  the  elements  of  m  always  satisfy  Xi  =  1 .  The  model 
described  is  similar  to  a  closed  network  of  servers  in  which  there  is  exactly  one  job  travelling 

(trapped)  between  the  service  centres.  Let  X*  be  some  none  zero  solution  of  the  traffic 
equations.  The  probability  distribution  of  m  is  given  as  [12]:  p  ( m  )  = 
tti  (mi)  .a2  (m2)  .  .  .a^ (nin) /G,  where  0^(0)=  l  &  (l)  =Xi*/|ii .G  is  a 

normalisation  constant  evaluated  as  :G  =  X  Hi  cti(mi),  where  the  summation  is  realised 

over  all  possible  vectorsm.  G  reduces  to  G  =  Xi^i*/lti.  Finally,  the  distribution  of 
soliton  structures  is  found  as. 

Pi  -  G"^.Xi*/)ii  which  is  the  key  result  of  this  paper. 

IV.  Exactly  solvable  example:  the  double  junction 

Figure  6  shows  an  application  to  the  2-junction  (or  single  quantum  dot)  compared  with  results 
obtained  by  the  Monte  Carlo  method(750  events). Above  a  threshold  voltage  Vth  electrons  can 
tunnel  into  and  away  from  the  dot,  one  electron  at  a  time.  For  N  excess  electrons  the  total 
electrostatic  energy  due  to  this  charge  is  (Ne)2/2Cr  where  (3r  is  the  total  capacitance  seen  by 
the  charge,  Cp  =Ci+C2+Co  and  Ci,C2  are  the  capacitances  between  the  dot  and  the  metallic 
electrodes  and  Cq  is  the  capacitance  to  the  ground  electrode.  In  this  case,  on  leaving  state  Sj  the 
system  can  only  make  a  transition  to  states  Si+]  or  Si.i(see  frgure  4).  This  is  a  Markovian 

birth-death  process  which  can  be  solved  exactly  using  the  transition  rates  Xn  (Sn-i  ->  Sn)  and 
l^n  (Sn  •>  Sn-i)  given  in  terms  of  the  tunnelling  rates  by;  Xn  =  rn-l(l,c)  +  rn-l(r.c);  Pn 
=rn(c.l)  +  rn(c,r).The  probability  of  finding  the  system  at  state  Sn  is  found  as: 

n 

Pn  =  Z"i.  n  (Xi/Hi)  (n  >  ki)  or  -  Z"1  (n  =  ki) 

i-kl+l 

k2  i 

Z  =1  +  X  n 

i-kj  n-k^+1 
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and  where  and  k2  are  the  minimum  and  maximum  possible  number  of  excess  electrons  that 
can  be  accommodated  on  the  dot.  The  states  contributing  to  the  process  can  be  discovered 

acccmling  to  the  condition:  if  Xj  >  0  and  lit  >  0  then  both  Si  and  Sj.]  are  legal  states. 

The  oscillations  in  the  dc  conductance  with  gate  voltage  at  low  bias  voltage  correspond  to 
transport  through  the  same  number  of  soliton  states.  In  Fig6(a)  transport  in  the  different 
segments  corresponds  to  states  (0,1),(1,2),  (2,3)  and  so  on.  At  higher  bias  moe  states  are 
generated  leading  to  splitting  of  the  segments.  In  Fig  6(b)  the  sequences  are: 
(0,I),(0,1,2),((1,2)  and  (1,2,3).  The  maximum  of  the  conductance  peaks  increases  with 
applied  bias  whereas  the  minima  decrease  in  amplitude.  For  high  bias  V»Vth  ,  the 
conductance  approaches  the  constant  valueG  ->  Gt/2.  The  spe^-up  ovt^r  Monte  Carlo 
simulation  (750  departures)  is  about  300  x  in  CPU  time. 


G  conductance 
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<n> 


(a)  V  =  0.2  e/C 


(b)V  =  0.5e/C 


Figure  6.  Conductances  G/Go  and  soliton  density  <n>  as  a  function  of  voltage  Vg  for  2  tunnel 
junctions  at  bias  voltages  V=0.2, 0.5  e/C.  Squares:Monte  Carlo  .lines:  queuing  theory . 

V.  Time  evolution 

Suppose  the  system  is  found  in  state  S  =  Sfc  at  t  =  0  with  initial  densities  Pi(0)  =  5ik.  At  a  later 
time  t  the  system  will  be  in  a  mixture  of  all  possible  legal  states  with  densities  Pi(t)  given  by  the 

rate  equation:  dPi/dt  =  £  (Pj  |iji}  -  Pi  m .  As  an  application,  consider  a  single  quantum  dot 

described  by  the  2  soliton  state  (S)  =  {  n,  n+I},  with  birth  and  death  coefficients  X  and  p. 
respectively.  If  the  system  is  known  to  be  in  state  Sn  at  t=0,  the  rate  equation  may  be  solved  to 

give:  Pn(t)  =  {fi+X  exp(-(X+n)t }  /(X+n);  Pn+l(t)  =  X  { 1  -  exp(-(X  +  ii)t )  /(X  +  n) 
which  in  the  limit  t  ->  «»o  recovers  our  earlier  result  Pn(~)  =  It  /  (X+n);  Pn+l(«»)  =  X  /  (X+^i). 

VL  General  application  to  multiple  junction  systems  and  quantum  dots 

Very  complex  behaviour  is  possible  with  soliton  propagation  in  multiple  junctions.  Figure  7 
shows  an  example  from  the  traffic  theory  where  we  have  enhanced  the  magnitude  of  the 
conductance  oscillations  in  a  3-junction  array  by  choosing  one  element  to  have  a  large  RCT 
value.  Here  Vr  =  0  and  VI  =  -V.  In  region  (a)  a  maximum  of  one  excess  electron  can  stay  in  the 
system;  the  possible  states  are  (0.0),  (0.1),  (1,0)  and  the  system  stays  mainly  in  state  (0,1).  In 
region  (b)  the  dominany  state  is  (1,1)  and  up  to  2  electrons  excess  occur.  In  region  (c)  1 1  states 
contribute  to  the  conduction  with  betwen  0  and  3  excess  electrons.  Region  (d)  has  12  states 
with  between  -1  and  3  excess  charges.  The  conductance  evidently  peaks  each  time  die  system 
can  accommodate  (xie  more  electron. 
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Figure  7.  I-V  characteristics,  conductance  and  number  of  soliton  states  for  a  3-junction  having 
R3  =  50  Ri,  Ri=R2  ;  Ci=C2=C3=C;  C=C0- 

The  methodology  has  been  extended  to  (a)  quantum  dot  systems  where  the  isolated  charging 

islands  have  discrete  energy  levels[13];  (b)  macroscopic  quantum  tunneelling  processes  [10  ] 

The  theory  is  being  applied  to  the  simulation  of  ultra-sroiall  coupled  Schottl^  dot  structures[6] 

for  which  the  capacitance  matrx  requirs  a  full  3D  Poisson  solver. 
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Abstract 

We  present  here  the  simulation  of  a  Si/SiGe  tnodulatitm-doped  HEMT.  The  electron  transport 
prop^es  were  obtained  from  Monte  Carlo  simulation.  We  use  a  set  of  quantum  hydrodynamic 
equations  for  the  device  simulation.  The  calculated  transconductance  is  about  300  mS/mm  at 
300  K.  Velocity  overshoot  in  the  strained  Si  chaimel  is  observed.  The  inclusion  of  the  quantum 
conectitm  increases  the  total  current  by  as  much  as  15  per  cent 


I.  Introduction 

Modulation  doped  Si|.;iGex/Si/Sii..Gex  offers  a  Si-version  of  the  HEMT.  With  achievable  high 
mobility  in  the  strained  Si  layer  (2000  -  3(X)0  cm2/V«s  at  3(X)  K  [1-21, 1  lt0(X)  cm^/V»s  at  77  K 
[1],  and  175,(XX)  cm^/V-s  at  1.5  K  [3-4]),  the  prospects  of  a  high  performance  FCT  are  good. 
Experimental  devices  have  achieved  transconductance  as  high  as  330  mS/mm  at  300  K  and  600 
mS^]m  at  77  K  in  a  0.25  pm  gt^  device  [1].  In  diis  paper,  we  present  a  numerical  simulation  of 
the  modulation-doped  Side  device. 

In  preparatitm  for  device  simulation,  we  have  obtained  electron  transport  properties,  such 
as  the  velocity-field  and  energy-freld  characteristics  for  strain»i  Si  and  Sii..Ge;(  material,  by 
Mcxite  Carlo  simulation  [2].  The  device  modeled  here  has  a  gate  length  of  0.18  pm  and  we  use  a 
set  of  quantum  hydrodynamic  equations  which  utilize  these  Monte  Carlo  results.  Tte  calculated 
transconductance  is  about  300  mSAnm  at  300  K.  Velocity  overshoot  in  the  strained  Si  channel  is 
observed.  The  inclusion  of  the  quantum  correction  increases  the  total  current  by  as  much  as  IS 
percent. 


II.  Modeling  of  Si/SiGe  niodulation>doped  FET 


A  set  of  hydrodynantic  equations,  which  is  described  below,  is  used  in  the  simulation.  The 
equations,  which  are  essentially  the  same  as  those  used  in  [5-7],  describe  the  particle 
conservation,  momentum  conservation,  and  energy  conservation.  Written  in  the  temperature 
representation,  we  have: 

^  +  V.(nv)  =  0.  (1) 

dr 

^  +  ,.Vv  .  -  ^  V(ntBrq)  -  (2) 


^  +  ^•V(Tn)  *  Vrfvrq) 
37  37  ^ 


dr 


+  2  _  J_y  _  T-Tp 

3%b  "tm  “^w 


(3) 
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whete  n  is  the  average  ekctnHi  density,  v  is  the  average  electron  velocity,  T  is  the  effective 
election  temperature,  m*  is  the  effective  electron  mass.  E  is  the  electric  field,  tm  is  the 
momentum  relaxation  dme,  Xw  is  the  energy  relaxation  dme.  To  is  the  lattice  temperature,  and  7q 
is  given  by 

rq  =  )r+  3^q.  (4) 

with 


and  /  is  the  degeneracy  factor  [7] 

r-  Wter)  -  ■  (6) 

where  Fj  is  the  Feimi-Dirac  integral,  and  i^is  the  Fermi  energy  measured  from  the  conduction 
band  edge.  The  factor  7is  introduced  as  a  coctectitxi  to  tire  total  average  electron  kinetic  energy 
(assuming  a  Fermi-Dirac  distribution  fiincdon); 

w=  |m*v2+  |»Br  +  {/q.  (7) 

Our  Monte  Cario  simulation  was  carried  out  to  obtain  the  transport  properties  of  the  SiGe 
materials.  The  computed  velocity-field  and  energy-field  relations  are  plott^  in  Figs.  1  and  2  for 
strained  Si  grown  on  relaxed  Sio7G^3.  with  naodulaticre-doped  concentrations  of  1.5  x  IQl^ 
cm*3  and  1.0  x  10^^  cm*^  at  300  tL  lire  velocity  curves  in  Fig.  1  show  that,  due  to  the  higher 
mobility  of  the  electrcxis  in  the  strairred  Si.  elections  have  higher  velocity  than  that  in  SiGe 
material.  The  low-field  mobility  is  found  to  be  about  3000  cm2/V*s,  a^  a  slightly  larger 
saturation  velocity  is  observed.  Tte  average  election  energy  (Fig.  2)  for  strairred  Si  rises  faster  at 
low  field  because  of  the  li^t  transverse  nutss  and  redireed  intervalley  scattering,  but  is  less  at 
high  field  due  to  impact  kmizatioiL 


10*  10*  10^ 


Electric  field  (V/cm) 

Figure  1.  Velocity  as  a  function  of  field  fm  strairred  Si  on  relaxed  SiGe,  found  from  Monte 
Cario  simulation. 

The  relaxation  times  Xm  and  Xw,  sritich  are  functxms  of  energy,  are  determined  by  fitting 
the  homogeneous  hydrodyruumc  equations  to  the  velocity-field  and  energy-field  relations  in  Fig. 
1  and  Fig.  2. 
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Fig.  2  Avera^  energy  as  a  function  of  field  for  strained  Si  on  relaxed  SiGe,  found  from  Monte 
C^o  simulation. 

The  numerical  simulation  has  been  applied  to  a  0.18  pm  ^ue,  quantum-well  device  with  a 
modulaticm-doped  structure  of  Sip  7Gep3/Si^io.7GeQ3.  The  device  structure  is  shown  in  Fig.  3. 
The  (^ing  of  the  top  Sio  TGeg^  layer  is  taken  to  be  3.5  x  10^^  cm*^,  and  a  dt^wg  of  1.0  x  10^^ 
cm*3  is  us^  in  the  substrate  Sto  ^Geo j.  The  lattice  ten:q;)erature  in  the  simuladmi  is  taken  to  be 
300  K.  The  typical  simulation ‘dom&  is  1.0  pm  x  0.095  pm.  The  thickness  of  the  top  SiGe 
layer  is  19  nm,  and  the  strained  Si  channel  is  18  nm.  Tire  graded  interface  transition  at  the 
SiGe/Si  junctitHi  results  in  an  effective  3  run  spacer  layer.  Ftre  simplici^,  we  c«ly  use  a  three- 
layer  structure.  The  modulaticm-doped  structure  in  [1]  is  more  complicated.  However,  the 
active  structures  are  similar. 


Gate 

Source  (~0.l8pm)  Drain 


Fig.  3  The  device  structure. 


nL  The  simulation  results 

The  conqnitBd  I- V  characteristics  for  tire  0. 1 8  pm  gate  device  are  shown  in  Hg.  4,  for  gate  biases 
of  0.7, 0.5, 0.2,  and  0  volts,  respectively.  The  stw  threkness  of  the  top  Si^  layer  provides  a 
normally-off  device,  since  a  Schotticy  barrier  hei^t  of  0.9  V  leads  to  an  estimated  depletion 
width  of  18.4  niiL  The  peak  transcondoctance  is  wout  300  mSArnn,  and  good  saturation  with  a 
drain  amductance  of  4.6  mSAnm  at  tire  gate  vttitage  of  0.5  V  is  obtaiired.  Approximately  tire 
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same  current  level  and  transcoiuluctance  was  found  in  a  0.25  device.  The  simulation  results 
are  cooqMrable  to  the  experimented  results  in  [1].  The  relatively  larger  cunent  level  (0.3  mAT^) 
and  transconductance  p30  mS/mm)  found  in  the  experiment  is  ^ught  to  be  due  to  a  higher 
sheet  density  (2.S  x  10^^  cm'^  [1]  compared  to  1  x  10*^  cm*^  in  this  simuladcm)  in  tlw  quantum 
well  for  their  particular  modulation-doped  structure. 


Fig.  5  Election  density  across  the  channel. 

Our  simulation  shows  that,  without  including  quantum  corrections,  the  cinrent  would  be 
15  per  cent  smaller  for  the  0.18  pm  gate  device  at  a  ^te  voltage  of  0.5  V,  as  shown  in  Fig.  4.  In 
other  words,  the  inclusion  of  the  quantum  potential  increases  the  total  current  by  as  much  as  15 
per  cent  in  the  simulation.  This  If^  modification  of  the  current  was  not  expected  in  the  device 
with  such  a  gate  lengdi.  However,  by  inspecting  the  density  distribution  along  the  channel,  one 
can  find  that  a  rapid  density  change  occurs  at  die  gate  end  close  to  the  drain  contact  within  a 
region  much  shcaW  than  the  gate  length.  In  light  of  the  quantum  correction  depending  on  the 
density  change,  the  modification  of  the  current  by  the  quantum  effects  is  understandable,  since 
the  elecmm  ^nsity  is  high  aral  the  density  change  occurs  in  a  short  distance.  As  we  expected, 
similar  doisity  distributum  across  the  conduction  channel  (to  what  we  found  in  the  GaAs/AlGaAs 
HEMT)  is  found  in  diis  device  [7].  The  electron  densities  with  and  without  quantum  potential 
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included  are  plooed  in  Fig.  5,  which  shows  the  increase  of  the  election  density  in  the  channel 
when  quantum  potential  is  included. 

Velocity  overshoot,  with  peak  velocity  2.6  x  10^  cm/s,  was  observed  in  die  strained  Si 
channel  for  both  gate  lengths  us^  in  this  simulation,  and  is  very  important  in  achieving  the 
transconductance  observ^  In  Fig.  6,  we  plot  the  longitudinal  velocity  along  the  conduction 
channel  in  the  quantum  well.  The  bias  ccmdition  in  this  case  is  V_  s  0.5  V  and  =  1.5  V.  The 
velocity  overshoot  in  the  gate  region  results  in  a  peak  velocity  of  2.6  x  10^  cm/s.  The  overshoot 
is  important  in  achieving  the  high  transconductance  for  the  device,  for  it  introduces  larger  current 
flow  along  the  quantum  well.  The  first  velocity  peak  in  the  plot  is  due  to  the  model  structure  we 
used  for  the  change  of  interface  discontinuity  [7],  although  it  is  not  practical,  it  does  sug^st  that 
the  structure  can  increase  the  electnm  velocity  between  source  and  gate,  which  in  turn  will  raise 
the  average  velocity  through  the  device  and  enhance  the  device  performance. 


Distance  from  source  to  drain  (run) 


Fig.  6  Longitudinal  velocity  in  the  quantum  well 
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Abstract 

A  parallelised  Chebyschev  accelerated  over-relaxation  algorithm  is  developed  for  simulating  the 
few  electron  problem  inhigh  magnetic  fields.  A  novel  form  of  resonant  tunnelling  is  proposed  to 
occur  through  a  single  potential  barrier  due  to  the  confining  potential  provided  to  one  electron 
by  a  following  electron  in  quasi-one  dimensional  current  flow. 

I.  Introduction 

Arrays  of  coupled  ultra-small  capacitor  stmctures  exhibit  significant  charging  effects  due  to  the 
large  charging  energies  deriving  from  the  small  values  of  the  components  of  the  capacitance 
matrix.  In  particular,  the  occurrence  of  single-electronic  effects  such  as  the  Coulomb  blockade, 
correlated  single-electron  tunnelling,  and  the  demonstration  of  switchable  single  flows,  has 
raised  hopes  of  a  future  single-electronics  technology.  TTie  “orthodox  moder‘[l]  of  correlated 
single-electron  tunnelling  has  been  highly  successful  for  the  relatively  large  metal-insulator 
systems;  it  assumes  continuous  energy  distributions,  large  numbers  of  states  in  the  ‘metallic’ 
electrodes  and  simplified  pictures  of  tunnelling  and  electron  correlation.  However,  many  tenets 
of  the  onhodox  theory  are  lost  in  the  2DEG  semiconductor  systems  and  in  the  recently 
proposed  coupled  Schottky  dot  structures[2].  Rather  than  a  single  electron  picture  we  must  de^ 
with  a  few-body  problem  in  which  only  distant  dense  ‘electrodes'  might  be  treated  by  self- 
consistent  mean  field  approaches. 

Previously[3,4,5]  we  have  developed  a  vectorised  algorithm  for  the  numerical  modelling  of  2D 
quantum  transport  through  quantum  point  contact  structures,  quantum  waveguides  and 
Aharonov  Bohm  ring  devices  using  the  ADI  algorthm  on  the  discretised  2D  Schrodinger 
equation.  This  method  fails  for  high  magnetic  fields  due  to  errors  arising  from  non¬ 
commutativity  of  the  split  kinetic  energy  operators.  It  is  also  unacceptable  in  accuracy  for  the 
few  body  problem  and  regions  of  complex  quantum  chaos. 

II.  Algorithms 

This  work  is  part  of  a  much  more  general  problem  which  concerns  the  modelling  of  the 
transport  of  n-interacting  electrons  within  an  arbitrary  shaped  quantum  waveguide  (hence 
discrete  energy  states)  including  charging  ^ects.  Our  algorithm  has  been  structined  such  that 
the  interacting  2  electron  problem  can  be  studied  for  a  ID  spatial  model.  The  2  election  problem 
in  configuration  space  is  isomorphic  to  the  problem  of  1  election  in  2D  space  in  the  presence  of 
an  effective  2D  potential  given  by  the  sum  of  the  confinement  potentials  for  each  electron  plus 
the  Coulomb  potential.  Consider  first  the  1  election  2D  time-dependent  Schrodinger  equation 

iti^  y/(x,y,t)  =  Hyf(x,y,t) 
at 

which  has  solutions  of  the  form 


(1) 


h  J 


V'(x.y.O=  v^(ac.y.O)exp|-i^ 

or 

Y(x,y,t  +  Ar)  =  ^(jc,y,r)exp|-^^| 


Using  the  Cayley  Expansion  for  the  exponential  and  defining  t  = 


W(x,y)'"=  Y(x,y)' 


1-//T 


iAt 

2h 


l  +  Ht 

Where  the  t  indices  refer  to  discrete  time. 


(2) 

(3) 

(4) 


Now  for  a  single  election  two-dimensional  system  with  time-independent  potential  profile  the 
time-independent  Schrddinger  Equation  is  given  by 


V(x,y)\y/(x,y) 


(5) 


Using  Taylor  Expansions  this  can  be  approximated  to  second  order  in  A  (the  spatial  grid 
spacing)  by  ft, 

Where  once  again  the  x  and  y  indices  refer  to  the  discrete  space.  Substituting  this  in  Eqn  (4) 
above,  re-ordering  and  defining  various  other  terms  then  gives 

+  ¥‘.ty  +  ^Xy  +  KXx  +  <l-X  = 

where 

^*-y  =  “  4  -  ^ 

^_4wA* 

P^irrr  (8b) 


(7) 


y= 


hAi 

2mA* 


Ky  =  (4+  +iP)¥'..y  -  ¥Uy  -  ¥Uy  "  Ky*X  "  ¥'..y-x 


(8c) 

(8d) 


^e  ^gOTthm  we  use  to  solve  this  set  of  equations  is  Chebyshev  Accelerated  Simultaneous 
Over  Relaxation,  a  succesive  tqiproximation  method.  The  residual  is  defined  as 

C  *  C..,K.,  +  (9) 

whOT  the  i  ii^x  lefeis  to  each  iteration.  This  is  iterated  tepealediy  in  a  chessboard  fashion  fi  e 
odd/even)  usmg  ^ '  ' 


=  y/  - 

r  x,y  Y  x,y  - 


(10) 


until  the  residual  is  sufficently  small.  The  initial  wave  fimetion  for  each  set  of  iterations  is  taken 
as  that  for  die  previous  time  step.  ©  is  defined  by 
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0)*=1 


0>‘  = 


1 


(l-iPy) 


(11) 


6)“**  = 


(l-ip>") 


where  Pj  is  the  spectral  radius  of  the  Jacobian  method  i.e. 
Pj  =i(cos(5J)  +  cos(%)) 

and  ]  and  L  are  the  number  of  grid  points  in  each  direction. 

IV.  Inclusion  of  a  magnetic  field 


(12) 


The  basics  of  this  algorithm  remain  the  same  when  magnetic  field  is  included  and  when  it  is 
adapted  for  two  ID  interacting  elcctrons.Using  the  symmetric  gauge  Eqns  (7)  &  (8)  become 

c,,  yf‘:i  +  (1  -  m  +  (1  +  m  +  d  -  iex)v[%  +  (1  +  m  (7a) 

where 


hAt 
2mA^ 

o;.,  ^(4+  )4',,  +iP+«>(4"+y))v‘.^ 

-(1  +  i9y)  -  (1  -  iOx)  -  (1  +  i&x) 

0  =  meBA* 

0  =  - 


(8a) 

(8b) 

(8c) 


in 

The  residual  is  then  defined  by 

C  =  Ky  +  (1  “  +  (1  +  iOy)  Kt'.,  +  (1  -  iOx) 

and  used  as  before  to  solve  for  each  succesive  approximation. 

V.  Two  Electron  case 


(8d) 

(8e) 

(80 

(9a) 


If  we  consider  x  and  y  as  the  ID  coordinates  of  two  interacting  electrons  the  time 
independent  Schrbdinger  equation  can  be  written  identically  to  Eqn  (5)  above  with 

V(x,y)  =  -— ^ - j  +  V,(x)  +  V.(y)  (13) 

Aj[eo\x-y\ 

where  V,  is  the  ID  electrostatic  confinement  potential.Therefore  using  this  as  the  potential 
enables  us  to  use  identical  methods  as  above. 

VI.  Parallelisation 

The  2  dimensional  grid  is  split  evenly  over  a  rectangular  array  of  transputers  which  iterate 
their  own  section  of  grid  and  swap  boundary  conditions  at  each  time  step. 
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Figure  (1)  transputer  linkage  (4x4  case) 

The  method  is  parallelised  on  a  64  transputer  array  and  runs  at  approximately  200  MFLOPS. 
The  results  of  this  study  are  also  being  used  to  validate  a  novel  algorithm  for  few  electron 
transport  in  2  dimensions  based  upon  an  extension  of  the  coupled-mode  formalism  [8] 
developed  for  1  electron  2D  tranport.  Applications  to  chaos  in  magnetic  fields  are  in  progress. 

VII.  Coulomb  assisted  resonant  tunnelling  in  quasi-lD  electron  systems. 

For  the  case  of  2-interacting  electrons  transporting/tunneling  through  bottlenecks  in  small  arrays 
of  quantum  point  contacts  we  have  identified  a  new  resonant  tunnelling  process  which  arises 
from  the  tunnelling  decay  from  the  quasi-confinement  of  one  electron  between  a  second 
following  electron  and  a  downstream  potential  barrier.  The  effect  requires  mono-mode 
conduction.  Hitherto,  only  the  combination  of  conventional  resonant  tunnelling  with  Coulomb 
blockade  from  the  charging  of  the  intermediate  quantum  well  has  been  identified.  Figure  (2) 
shows  the  case  of  two  equal  spin  electrons  initially  30  nm  apart  (Coulomb  energy  4.5  meV) 
encountering  a  barrier  of  height  50  meV  ,  width  5  nm  for  incident  kinetic  energy  of  10 
meV/electron  for  electrons  in  GaAs.  The  effect  is  strongly  damped  unless  the  electron  stream  is 
strongly  correlated  in  space  and  time.  Coupled  quantum  dots,  long  chain  polymers,  enzymes, 
redox  chains  provide  possible  test  cases. 
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Abstract 

Single-electronic  systems  are  described  by  coupled  sets  of  circuit  equations  which  link  the 
charge,  voltage  and  current  distributions  of  tunnelling  and  non-tunnellmg  capacitor  arrays.The 
concept  of  critical  charge  is  used  to  implement  an  efficient  network  solver.  System  simulation 
may  then  be  achieved  by  Monte  Carlo  methods  although  this  is  often  prohibitive 
computationally  .Alternatively,  linear  programme  techniques  can  establish  the  boundaries  for 
stable  operation.  The  full  3D  modelling  of  the  capacitance  matrix  is  required  for  recently 
develop^  Schottky  dot  structures. 

I.  Introduction 

The  recent  development  of  single-electronic  devices  has  exploited  the  phenomenon  of 
correlated  single  electron  tunnelling[l]  in  coupled  tunnel  junctions(ultra-small  capacitor  arrays) 
under  conditions  set  by  the  Coulomb  blockade  threshold  e2/2C  »  kT.  At  least  three  different 
classes  of  structure  are  currently  under  study  experimentally:  vertical  metal-insulator-metal[2- 
4],  lateral  metal  semiconductor  metal[5] ;  and  laterally  patterned  two-dimensional  electron  gases 
in  semiconductor  hcterostructures[6].  These  new  devices  pose  new  challlenges  for 
computational  electronics[7]:  they  are  strongly  capacitively  coupled,  the  major  transport 
mechanism  is  correlated  tunnelling,  the  presence  of  thermal  fluctuations,  cross-talk,  charge 
trapping  de-trapping  and  macroscopic  quantum  tuimelling  are  all  potentially  killer  effects.  Most 
significantly  existing  single-electronic  systems  may  be  quite  large  (up  to  100  devices)  requiring 
new  systems  tools  for  design  and  analysis. 

II.  Network  solver 

We  have  developed  a  set  of  simulation  tools  based  on  a  similar  formalism  to  Bakhvalov  et 
al[10]  for  a  linear  array  of  tunnel  junctions  but  extended  to  arbitrary  single  electron  tunnel 
junction  circuit  configurations  using  a  matrix  representation  of  the  various  voltages,  currents 
and  circuit  elements.  The  general  theory  derives  from  an  analysis  of  a  basic  tunnelling  event 
and  a  single  tunnel  junction  (capacitance  C)  in  series  with  a  non-tunnelling  capactance  and  a 

voltage  source  Vq.  The  tunnelling  rate  F  is  determined  by  the  temperature  T  and  the  free  energy 

change  following  the  tunnel  event[l]:  r(AE,T)  =  (AE/e2Rt)[exp(AE/kBT)-l  )’l  where  Rt  is  the 
juncticHi  tunnelresistance.  At  low  temperatures 

r(AE)=(-AE/e2Rt)(AE<0);r(AE)  =  0(AE<0).  (1) 

For  the  simple  circuit  we  have:  E  =  Qc^/(2Ce)  +  Qc^/(2C)  -t-  (JyVo;  where  Qv  is  the  charge 
through  the  voltage  source  and  the  critical  charge  Qc  =  e/(2[l+Ce/C]).  For  a  single 
electrcmtunnel  evoit  we  find 

AE  =  -eQ/C  +  e2/(2(C-*<:e))  (2) 

This  “instantaneous”  model  for  tunnelling  provides  a  first  level  for  our  system  simulation  tools. 
A  second  level  utilises  the  quantum  Langevin  equation[  ]  approach  handle  fluctuations  (due  to 
lack  of  space  we  do  not  describe  that  here).  The  critical  charge  is  the  charge  at  which  tunnelling 
becomes  advantageous;  the  critical  charges  for  a  circuit  depend  solely  on  the  junction 
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capacitance  and  the  lumped  capacitance  Ceff  of  the  remaining  circuit.  We  have  developed  a 
network  solver  for  Ceff  by  deploying  an  impedance  matrix  Z,  positive  definite  with  rank  equal 
to  the  number  of  loops  in  the  circuit.  The  diagonal  elements  are  the  total  impedances  around 
each  mesh;  the  off-diagonal  terms  are  the  total  impedances  shared  by  two  loops.  By 
partitioning  this  matrix  we  can  separate  out  loops  of  no  interest  to  obtain  a  lower  rank  effective 
in:q)edance  matrix  Zeff.  Hence  we  may  determine  the  critical  charges. 

III.  Monte  Carlo  and  Linear  Programming  simulators 

The  Monte  Carlo  simulator  uses  the  general  network  solverto  find  the  critical  charges  for  each 
device  fex*  which  tunnelling  becomes  feasible  in  the  tunnel  junction  network.  The  simulator  then 
iterates  “events”  by  repeatedly  characterising  the  circuit  for  a  given  set  of  input  voltages  and 
clocked  charge  positions,  discovering  which  tunnelling  event  will  occur  next  and  updating 
dependent  and  independent  circuit  parameters  using  charge  conservation  and  the  circuit  matrix 
equations.  Although  this  approach  is  important  it  is  computationally  expensive  especially  for 
tlw  larger  extended  systems  of  coupled  devices. 

To  obtain  a  more  rapid  asses.<(ment  of  the  possible  stable  operating  regimes  of  single  electronic 
devices  and  systems  we  have  developed  a  linear  programming  technique  which  allows  us  to 
determine  the  allowed  regime  of  stable  operation  in  the  control  parameter  space.  The  method  is 
essentially  an  inverse  of  the  Monte  Carlo  approach:  the  allowed  or  disallowed  tunnel  events  are 
defined  &st  followed  by  a  detrmination  of  the  circuit  voltages  and  charge  values.  The  approach 
is  illustrated  in  figures  2-4  for  a  turnstile  device[4]  shown  schematically  in  figure  1.  The  Monte 
Carlo  results  for  the  operational  area  of  Vg-Va  space  (fig  4)  are  given  by  accumulating  legal 
points.  The  area  so-defined  is  found  to  be  well-modelled  by  boundary  lines  determined  fi'om 
linear  programming  using  the  tunnel  event  schematics  of  figure  3. 


These  new  simulation  tools  have  been  used  to  study  effects  of  cross-talk  and  inter  array 
coupling  in  single-electronic  systems.  Figure  6  shows  typical  results  of  the  effects  of  stray 
capacitance  couplin  Cstray  current  through  two  pa^el  tunnel  junction  arrays  shown  in 
figures. 
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IV.  Schottky  dot  systems 


The  traditional  approach  to  single-electronic  devices  has  utilised  the  hanging-resist  technology 
developed  for  inetal-insulator-metal  tunnel  junctions[2].  Although  this  method  has  allowed 
construcdon  of  large  numbers  of  interconnect^  junctions  in  ccxnplex  circuits  it  is  limited  by  the 
lithography  to  relatively  large  capacitances  and  consequently  very  low  operating  temperatures. 
Very  recently  a  much  finer  resolution  lithography  which  has  fewer  processing  stages  and 
involves  forming  ultra-fme  (  <  S  nm  radii  and  spacings)  metal  on  semiconductor  electrodes. 
Schottky  islands  and  dot  arrays  has  been  develop^  at  Glasgow[S](Fig  7).These  new  structures 
involve  ultra-small  capacitances  with  equivalent  Coulomb  blockade  temperatures  in  excess  of 
60K  scaleable  to  much  greater  than  room  temperature.  The  dot  structures  may  be  arranged 
laterally  to  form  pass  transistors  (figS),  RAM  cells  (fig9)  or  more  complex  circuits.  An 
essential  feature  of  these  systems  is  the  requirement  for  the  “tunnelling  tails”  of  the  Schottky 
islands  to  overlap  thus  permitting  correlated  electron  tunnelling  from  dot  to  dot  via  the 
semiconductor.  Modelling  of  these  systems  is  crucial  in  order  to  determine  the  effective 
capacitance  matrix  and  for  understanding  how  to  control  effects  of  unwanted  traps  (charge- 
trapping  de-trapping  effects)  which  can  destroy  single  electronic  stability.by  structuring  the 
substrate,  ground  plwes,  deling  levels  and  island  geometry. 

The  develpment  of  stable  and  reliable  single  electronic  systems  requires  the  precise  design  of 
both  the  junction  capacitances  (inter-capacitances)  and  the  capacitances  to  ground.  Since  the 
measurement  of  such  ultra-small  capacitances  is  very  difficult  experimentally  the  design 
becomes  reliant  on  the  numerical  simulation  of  the  capacitance  matrices.  The  problem  is 
essentially  a  3D  problem  involving  complicated  geometries,  several  dielectric  regimes,  and 
device  physics  which  involves  surface  conditions,  random  distributions  of  traps  and  their 
dynamics. 

To  illustrate  part  of  the  design  problem  and  the  importance  of  an  adequate  numerical  solution 
we  present  here  some  results  of  2D  simulation  of  Aluminium  wires  on  the  surface  of  p-sUicon 
as  an  approximation  to  the  Schottky  dot  devices  shown  in  Figures  7  9.  The  simulations  were 
performed  by  the  simulator  H2F  [8].  Figure  10  shows  the  potential  distribution  around  two 
40  nm  width  wires  with  12  nm  spacing.  The  fringing  effects  and  presence  of  the  silicon 
substrate  increased  the  junction  ciq)acitance  4  times  in  comparison  with  the  sinqjle  parallel  plate 
formula.  The  ground  capacitance  is  more  than  twrice  as  high  as  the  junction  capacitance.  The 
presence  of  surface  piiming  states  modifies  the  picture.  The  i^sence  of  donor  type  states  near 
the  middle  of  the  bandjgap  increaes  the  ground  capacitance  slightly,  but  a  nx>te  profound  effect 
is  produced  by  modelling  acceptor  states  which  significantly  increae  the  capacitancc(25%).Thc 
juction  and  ^ound  capacitances  may  be  tuned  by  anisotropic  etching  leaving  metal  islands  on 
the  top  of  silicon  pedestals  (figure  11  ).  It  is  found  that  200nm  etching  reduces  the  junction 
capacitance  by  a  factor  of  3x  and  the  capacitance  to  ground  is  diminish^  by  a  factor  of  2x.  A 
parallelised  3D  simulator  is  under  development  for  for  more  realistic  prediction  and  design  of 
such  devices[9]. 
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Figure  10.  Potential  distributitm  around  two  Figure  11.  Potential  distribution  around  two 
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Abatract 

We  demonstrate  a  useful  decon^sition  of  die  po^dal  energy,  which  accelerates  the  relaxation 
method  for  Ending  the  ground  state  of  a  Schrbdinger  operator  in  multiple  space  dimensions, 
and  improves  the  ultimate  accuracy  achievable.  The  potential  decomposition  makes  the 
SchrOdmger  operator  approximately  separable  for  the  evolving  approximation  to  the  ground 
state;  this  decreases  the  error  associate  with  the  long  time-steps  in  alternating-difference 
in^licit  schemes. 


L  Review  and  Motivation 

A  central  step  in  many  quantum  modeling  problems  is  to  find  the  lowest-energy  eigenstates  of  a 
Hamiltonian  operator.  A  standard  tqiproach  to  find  them  is  the  relaxation  technique  [1]. 

The  relaxation  technique,  applied  to  End  the  ground  state  of  Hamiltonian  H,  amounts  to 
evolution  of  the  Schrddinger  equation  in  imaginary  tiine: 

H9(/)=-»aKP(/),  (1) 

Starting  from  an  initial  condition  that  has  nonzero  overlap  with  the  ground  state,  the  ground 
state  is  asytiqitotically  dominant  after  imaginaiy  times  long  compared  to  tio liE\~EQ), 

<P(0  ~  ao(/)  90  •  (2) 

(£b  and  £i  ate  the  ground  and  Erst  excited  energies  of  the  Hamiltonian,  respectively;  oq  is  a 
space-independent  coefEcient)  The  primary  goals  in  such  an  evolution  are  stability  and 
accuracy.  InqrUcit  techniques  exist  which  are  well-known  to  yield  stable  evolution  both  in  real 
and  inu^inary  time,  in  which  accuracy  can  always  be  assured  by  using  a  sufEciently  smudl  time 
step  4/.  The  simplest  of  diese  is 

(  1  +  AtHin  )  tpi  =  9»-^  (3) 

with  (pi  ■  <p(r,);  tj-  '}At. 


However,  just  as  not  all  eigenstates  ate  required,  similarly  not  every  kind  of  accuracy  is 
required  either.  In  particular,  since  the  object  of  the  relaxation  is  simply  to  remove  the  high- 
energy  components  as  nqiidly  as  possible,  in  order  to  examine  what  is  leE,  an  accurately 
eiqKmential  decay  of  diose  cotrqKments  is  less  important  than  their  mere  rt^id  disappearance. 

Fot  this  reason,  it  is  common  to  accelerate  the  relaxation  procedure  by  using  long  time  steps. 
When  this  acceleration  is  used  with  one-dimensional  Hamiltonians,  one  converges  to  die 
ground  state. 

The  evolution  described  in  (3),  like  other  implicit  evolution  methods,  requites  inversion  of  a 
large  matrix  (in  this  case,  the  matrix  rqnesenting  \-¥  At  WH  ).  The  matrix  to  be  inverted  has  a 
dimension  equal  to  the  number  mesh  ^ints.  This  dimension  grows  as  the  inverse  of  the  mesh 
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q^ing  nised  to  the  power  of  the  mesh  dimension.  For  multidimensional  problems,  this 
inversian  is  impncticaf  or  numerically  intractable.  Instead,  standard  time-evolution  techniques 
for  muld-dimeasional  Hamiltonians  typkaUy  use  an  operator-separation  or  alternating-direction 
implicit  (ADI)  scheme.  For  example,  deconqjosing  a  two-dimensional  Hamiltonian  H^  as 


H2D-H*  +  Hy, 

(4) 

one  may  write 

m  <pi“i , 

l+drHV»  l+dfHy/» 

(5) 

oreqoivalendy: 

(l  +  at  lV>in  )  » «p»-i . 

(6a) 

(l  +  At  HV»  )  . 

(6b) 

If  H^  and  H^  involve  only  the  kinetic  energy  of  motion  along  x  and  y  directions  respectively, 
then  each  step  (6a,  6b)  requires  die  solution  only  of  a  tridiagonal  matrix. 


In  ADI  schemes,  short  time  steps  are  necessary  not  only  for  the  accurate  exponential  decay  of 
high-energy  states,  Imt  also  for  the  accurate  estimation  of  the  ground  state.  For  exanqile,  in 
(S),  p  tqiproaches  the  ground  state  of 

H'»H2D  +  H*Hy-,  (7) 

h 

which  by  perturbation  theory  implies  a  ground  state  energy  that  also  is  accurate  only  to  first 
order  in  dr.  Asa  result,  while  short  time  steps  may  be  used  initially  to  accelerate  the  di^pation 
of  high-energy  components,  eventually  short  time  steps  must  be  used  to  achieve  accurate 
ground  states.  In  this  kind  of  variable  time-step  relaxation,  it  can  be  difficult  to  determine 
unambiguously  how  well  one  is  converging.  (See  however,  work  by  Doss  and  Miller  on  ADI 
solution  of  Laplace  and  related  equations  [2],  in  which  a  way  is  found  to  optimize  the  time 
step.)  In  generid,  efficient  use  of  tlus  approach  can  require  a  certain  amount  of  user  interaction, 
and  is  corre^ndingly  difficult  to  pro^am  for  systematic  application. 

SqiaraUe  Hamiltonians  constitute  an  important  exception  to  the  above  distinction  between  one- 
diimnsional  and  higher-dimensional.  That  is,  if  an  ADI  scheme  consists  of  the  alternating 
ippUcation  of  the  separated  conponents  of  a  separable  Hamiltonian,  dien  relaxation  converges 
to  the  ppropriate  ground  state  even  for  large  time  steps.  [This  can  be  seen  immediately  in  0), 
from  dw  fact  that  die  ground  state  is  simultaneously  an  eigenstate  of  H’^  and  H^  as  well  as 
H^.]  Most  Hamiltonians  (almost  all,  in  the  appropriate  probabili^-measure  sense)  cannot  be 
put  into  separable  form.  However,  as  we  describe  below,  it  is  possible  to  find  a  decomposition 
^at  is  sepuable  in  a  restricted  sense;  the  Hamiltonian  is  partitioned  in  such  a  way  that  at  least 
the  groiuid  state  is  simultaneously  an  eigenstate  of  each  partitioned  component  This  permits  an 
acceleration  of  relaxation  without  die  usual  penalty  in  accuracy. 


IL  Adaptive  Potential 

Most  Hamiltonians  (almost  all,  in  the  ^^priate  probability-measure  sense)  cannot  be  put  into 
sqiarable  ftmrt  However,  as  we  describe  below,  it  is  possible  to  find  a  decomposition  that  is 
sparable  in  a  restricted  sense,  thus  permitting  an  acceleration  of  relaxation  without  the  usual 
poialty  in  accuracy. 


In  Older  to  simplify  the  prescatation,  vn  here  consukr  only  two-dimensional  Hamiltonians  (4). 
We  seek  new  polnitials  Vq(xo')  and  V^(xo')  so  that,  with 

h;.K*  +  vJ  .  Hj-Ky  +  Vj,  (8) 


the  ground  state  po  of  also  satisfies  the  auxiliary  equations 

(9) 

If  po  is  known,  (9)  can  be  solved  for  the  potentials  using  (8).  Because  the  total  potential  must 
always  be  V,  it  ^fices  to  determine  only  the  single  function 

AV«|(v*-Vy).  (10) 

During  the  time  evolution,  one  knows  only  p.  the  evolving  best  approximation  to  po-  To 
indicate  this  we  omit  the  zero  subscripts  on  the  separated  potentials.  Assuming  that  p  -  Po 
small,  we  “solve”  (9)  to  find 


(11) 


In  the  adaptive-potential  ADI  (APADI)  iqiproach,  we  recompute  AV  after  each  time  step.  Using 
this  AV  directly  to  define  the  current  potential  is  unstable,  so  instead  we  relax  exponentially 
toward  the  instantaneous  potential,  with  AV  for  the  ith  time  step  defined  by: 


AV*  s  (l-<i)AV‘-l  +  a  AV[p*-l] , 
and  depending  on  a  relaxation  parameter  a. 


(12) 


III.  Numerical  Results 

We  have  iqiplied  the  potential  (12)  in  both  first-order  and  second-order  operator  separation 
schemes.  We  first  treat  a  separable  example  (a  sum  of  finite  square-well  potentials,  in  the  x  and 
y  directions)  in  order  to  compare  the  adaptive-potential  ADI  (APADI)  scheme  not  only  with 
naive-potential  ADI  (NPADI:  V*  =  V^  =  V/2 )  but  also  with  the  a  partition  using  the  exact 
separa^  potentials  ADI  (SPADI).  Figure  1  compares  these  in  a  Hrst-order  scheme.  We  plot 
the  fractional  deviation  of  the  computed  ground  state  energy  from  the  exact  ground-state 
energy,  using  a  constant  time  step  (At  -  2,  in  units  chosen  so  Aat  the  electron  mass,  ftngstrbm, 
and  electron-volt  have  unit  magnitude),  with  a  one-dimensional  well  depth  of  0.2  eV,  a  well- 
width  of  40  A,  and  electron  mass  of  0.1  x  free-electron  mass.  The  relaxation  parameter  a  is 
0.05.  APADI  clearly  represents  a  significant  improvement  over  the  standard  approach, 
represented  by  NPADL 

After  t  ■•200,  SPADI  fluctuates  due  to  round-off  eirtv.  Using  second-order  operator  separation 
(“true”  ADp,  the  APADI  scheme  is  significantly  better,  to  the  extent  that  APADI  and  SPADI 
differ  only  in  this  round-off  error  regime. 
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Figure  1.  Fractional  error  in  the  ground  state  energy,  found  using  first-order  operator 
separation  for  an  exaedy  separable  probleta 

A  nonseparable  example,  illustrated  below,  is  based  on  the  total  potendal  illustrated  in  figure  2 
(piecewise  constant  0.0  eV  in  first  and  third  quadrants,  0.2  eV  in  the  remaining  two,  with  hard 
walls  around  a  square  of  side  4(X)A). 


Figure  2.  A  nonseparable  potential,  with  the  x  axis  along  the  right-hand  edge,  and  the  y  axis 
atong  the  kdt-hand  edge. 
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After  relaxing  toward  a  solution,  V*  had  the  form  shown  in  Fig.  3.  Note  that  V*  is  different 
for  each  energy  leveL  The  special  case  in  which  it  is  the  same  for  every  level  is  the  usual  case 
of  a  separable  poientiaL 


Figure  3.  V*  for  ground  state  of  V  in  figure  2  (different  scale,  perspective:  x-axis  still  to  right) 

The  graph  corresponding  to  Fig.  1,  for  the  nonseparable  potential,  is  Fig.  4.  This  calculation 
was  done  using  a  second-order  ADI. 


Figure  4.  Fractional  error  for  nonseparable  potential 
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Abstract 

We  present  a  numerical  study  of  the  chemical  potential  and  of  the  capacitance  in  a  model 
quantum  dot.  Our  model  includes  the  electron-electron  interaction  and  exchange  and 
correlation  effects  within  the  framework  of  density  functional  theory.  Our  results  exhibit 
the  t3q)ical  features  observed  in  experiments,  such  as  the  increase  in  the  capacitance  for 
increasing  number  of  electrons  and  the  presence  of  irregularities  in  the  succession  of  the 
chemical  potential  values  vs.  the  electron  number. 

I.  Introduction 

Recent  experimental  results  have  shown  extremely  interesting  single-electron  phenom¬ 
ena  in  semiconductor  quantum  dots.  The  conductance  across  a  quantum  dot  loosely 
coupled  to  the  external  electrodes  via  low-transparency  tunnel  barriers  has  been  shown 
to  be  periodic  with  respect  to  the  voltage  applied  to  suitably  positioned  gates  [1-3]. 
This  effect  has  been  explained  as  the  consequence  of  the  chemical  potential  in  the  dot 
lining  up  with  the  one  in  the  external  electrodes  [2].  If  the  electrostatic  energy  prevails 
over  the  quantum  confinement  energj',  the  behavior  of  the  dot  is  substantially  capac¬ 
itive  and  we  expect  evenly  spaced  values  of  the  chemical  potential  versus  the  number 
of  electrons.  Several  theoretical  studies  of  quantiun  dots  exist  in  the  literatme  [4-5], 
dealing  with  both  very  ideahzed  models  and  with  reedistic,  three-dimensioned  models. 
Our  aim  has  been  to  solve  for  the  chemical  potential  in  a  structure  which,  even  though 
simplified,  retains  all  the  characteristic  features  observed  in  the  experiments.  We  study 
a  2-D  model  quantum  dot  defined  by  a  given  confining  potential.  Except  for  this  sim¬ 
plification,  i.e.  the  confining  potential  not  determined  from  the  actual  characteristics 
of  the  semiconductor  layers  and  the  geometries  and  voltages  of  the  metal  gates,  we  try 
to  t2ike  into  account  all  relevant  contributions,  including  many-body  effects,  within  the 
framework  of  density  functional  theory. 

II.  Quantum  dot  model 

Our  model  quantum  dot  is  two-dimensionad  and  defined  by  the  shape  of  the  confining 
potential,  represented  by  hard  walls  along  the  perimeter  of  a  square  region  to  which 
we  shall  refer  in  the  following  as  “quantum  box” .  The  potential  within  the  box  is  not 
constant,  but  quasi-paurabolic,  as  the  one  produced  by  a  positive  backgroimd  charge  of 
lOOg  (<jr  being  the  electron  charge),  imiformly  distributed  over  the  surface  of  the  box. 
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The  potential  effectively  seen  by  the  electrons  in  the  calculations  presented  in  this  paper 
is  substantially  the  quasi-parabolic  part,  since  this  already  provides  enough  confinement 
as  to  make  the  electron  density  vanish  before  reaching  the  hard  walls.  This  is  a  re2ilistic 
approximation  of  the  actual  potential  in  the  experimental  situations. 

Our  study  is  performed  at  the  temperature  of  0  K,  thus  for  a  system  in  the  ground  state 
We  solve  for  the  eigenfunctions  and  the  eigenenergies  of  the  quantum  dot  by  means  of 
an  iterative,  self-consistent  procedure  [8]  based  on  the  Kohn-Sham  density  functional 
approach  [9]. 

The  Schrodinger  equation  is  discretized  with  a  standard  five-point  formula  and  the 
eigenvalues  and  eigenvectors  of  the  matrix  thus  obtained  are  computed  by  means  of  a 
Ritz  iterative  procedure. 


III.  Numerical  results 

The  chemiced  potential  is  the  main  quantity  we  want  to  evaluate.  Once  the  wave  func¬ 
tions  and  the  energy  eigenvalues  have  been  determined,  there  are  several  ways  to  obtain 
the  chemical  potential  /i(N)  {N  being  the  number  of  electrons).  We  can  compute  the 
total  energy  E(N)  of  the  system  from  [9] 


E{N) 


— — —dr  dp 


i=i  ^  J  J 

+  j n{f)  [Eei{n{f))  +  E  corr  (n(r))  -  VexiTi{f))  -  ^  corr  (n(f))]rfr,  (1) 


where  e,  are  the  energy  eigenvalues  for  each  electron,  n(r)  is  the  total  electron  density, 
EexiEcorr,Vex^ycorr  are  the  exchange  and  correlation  energies  and  potentials,  respec¬ 
tively. 

From  the  definition  of  chemical  potential  we  have  that  p{N)  =  E{N)  -  E{N  —  1).  The 
result  of  this  differentiation,  however,  may  be  adversely  affected  by  numerical  errors  in 
the  values  of  E{N)  and  E{N  —  1).  A  plot  of  p  versus  N  obtained  with  this  procedure  for 
a  60  X  40  nm  box  is  shown  in  Fig.  1  with  solid  dots.  In  the  same  figure  we  report,  with 
empty  squeires,  the  chemical  potential  computed  for  the  same  structure  with  a  different 
method:  Slater’s  approximation.  The  “removal  energy”,  i.e.  the  energy  needed  to 
remove  one  electron  from  a  system  of  interacting  electrons  can  be  approximated  [10] 
by  £{N  -t-0.5),  the  energy  eigenvalue  for  a  fictitious  additional  particle  with  charge  q/2. 
It  is  apparent  that  for  a  small  number  of  electrons  both  methods  yield  the  same  result 
while,  for  more  than  10  electrons,  the  technique  based  on  the  difference  between  the 
total  energies  for  N  and  —  1  electrons  starts  being  severely  affected  by  numerical 
noise.  We  have  therefore  used  Slater’s  approximation  in  most  of  our  calculations. 

In  Fig.  2  w'e  report  an  idealized  representation  of  the  conductance  peaks  which  would  be 
measmred  between  two  leads  very  loosely  coupled  through  the  a  square  quantum  box  of 
various  sizes.  The  height  of  the  peaks  is  purely  conventional  and  there  is  no  broadening 
because  of  the  assumption  of  0  K  temperature  and  of  vanishingly  small  transpjirency  for 
the  tunnel  barriers.  The  three  plots  correspond  to  three  different  box  sizes:  40  x  40  nm, 
80  X  80  nm  and  200  x  200  nm. 
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Fig.  1  Chemical  potential  vs.  number 
of  electrons 
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Fig.  2  Conductance  peaks  for  a  square 
quantum  box 


The  quantity  on  the  abscissae  axis  is  the  dot  potential  scaled  multiplying  it  by  the  length 
of  the  box  side  in  nanometers.  This  scaling  normahzes  the  three  plots  with  respect  to 
the  energy  associated  with  the  Coulomb  interaction  [1].  The  first  peak,  coincident  with 
the  zero  reference  for  the  dot  potential,  is  for  three  electrons. 

The  spacing  between  conductance  peaks  is  very  uneven  for  the  smallest  box  and  a 
more  careful  exam  of  the  plot  shows  that  there  are  groups  of  evenly  spaced  peaks 
corresponding  to  the  various  degenerate  energy  levels  in  the  square  box.  For  example, 
the  peaks  for  the  3rd,  4th,  5th  and  6th  electrons  correspond  to  to  the  2nd  and  3rd 
single-electron  orbitals,  which  are  degenerate  in  a  square  box,  in  the  absence  of  electron- 
electron  interaction. 


n*30 


Fig.  3  Electron  density  in  a  square  quantum  box. 

As  the  size  of  the  dot  increases,  we  observe  a  reduction  of  the  relative  importance  of 
the  quantization  energy  and  the  clear  prevalence  of  the  Coulomb  energy,  leading  to  an 
almost  even  spacing  for  the  200  x  200  nm  box.  The  plot  for  this  largest  box  also  shows 
an  effect  which  has  been  experimentally  observed:  the  reduction  of  the  average  spacing 
between  peaks  with  increasing  number  of  electrons.  This  effect  is  mainly  due  to  the 
increase  of  the  area  of  significant  electron  density  when  more  electrons  are  added  to  the 


dot  and  see  a  less  attractive  potential,  because  of  the  screening  performed  by  the  ones 
in  the  lower  orbitals. 

In  Fig.  3  contour  plots  of  the  electron  density  are  shown  for  a  number  of  4,  10,  15,  30 
electrons  confined  in  a  120  x  120  nm  quantum  box.  There  is  a  very  significant  change 
in  the  extension  of  the  non-zero  electron  density  and  in  its  shape  when  the  number  of 
electrons  is  increased.  This  accounts  for  the  observed  crowding  of  conductance  peaks. 
A  differential  capacitance  Cd  can  be  associated  with  a  quantum  dot,  according  to  the 
following  definition: 


Cd{N)  = 


n{N+\)-n{Ny 


(2) 


This  quantity  is  more  readily  imderstandable  if  we  consider  a  dot  in  the  proximity  of 
some  conducting  surface  such  as,  for  example,  a  metal  gate.  This  is  also  the  most 
common  configuration  in  the  devices  which  have  been  experimentally  investigated.  The 
differential  capacitance  represents  the  ratio  of  the  electron  charge  to  the  variation  of  the 
voltage  between  the  gate  and  the  dot  when  an  electron  is  added  to  the  system.  If  the 
gate  is  much  larger  than  the  dot,  it  can  be  approximated  with  an  infinite  conducting 
surface  amd  modelled  with  properly  placed  image  charges. 


Fig.  4  Capacitance  vs.  length  for  4  electrons. 

The  results  of  our  calculations  of  the  capacitance  [8]  in  the  presence  of  metal  gates  are 
shown  in  Fig.  4.  The  four  curves  are  for  4  electrons  in  a  rectangulaur  quantum  box  with 
a  length/width  ratio  of  4/3  and  a  metal  gate  at  a  distance  d  of  5,  15,  60  nm  or  no  gate 
at  all.  We  see  that,  while  for  no  gate  and  for  a  distant  gate  the  capacitance  grows  in  a 
substantially  linear  fashion  as  we  would  expect  for  a  2-D  geometry  isolated  in  3-D  space 
(see  e.g.  [2]),  for  a  very  close  gate  the  behavior  becomes  almost  queidratic,  reaching  the 
well-known  classical  limit  of  the  parallel  plate  capacitor.  For  a  small  gate-dot  separation 
we  perform  simply  a  Hartree  calculation,  because  the  expressions  used  for  the  exchange 
and  correlation  potentials  [8]  do  not  hold  in  the  presence  of  strong  interaction  with  the 
image  charges. 


rv.  Conclusion 

We  have  studied  a  model  two-dimensioned  quantum  dot  with  a  quasi-parabolic  confining 
potential,  including  many-body  effects  within  a  density  functional  approach.  Resvdts  for 
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the  chemical  potential  in  a  square  box  have  been  presented,  showing  the  transition  from 
a  behavior  dominated  by  quantum  effects  to  one  in  which  the  Coulomb  charging  energy 
is  predominant.  The  differential  capacitance  which  can  be  thus  defined  approximates 
the  one  of  a  conducting  2-D  surface  with  a  shape  like  the  one  of  the  area  of  significant 
electron  density  in  the  quantum  dot. 
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Abstract 

We  present  numerical  methods  for  a  rather  new  model  to  calculate  quantum  steady  states 
of  electrons  in  thermal  equilibrium.  The  well  known  Bloch  equation  is  generalized  to  ar¬ 
bitrary  distribution  functions  (Fermi-Dirac)  <ind  appropriately  coupled  to  the  Poisson 
equation.  A  rational  approximation  of  the  distribution  function  yields  an  iterative  algo¬ 
rithm. 

I  Introduction 

We  present  and  discuss  algorithms  to  calculate  quantum  steady  states  of  thermally  equi¬ 
librated  electrons  in  a  semiconductor  device.  We  consider  a  one-electron  approximation 
with  a  self-consistent  potential  describing  the  Coulomb  interaction  of  the  electrons  with 
each  other  and  with  the  background  of  (positive)  ions. 

The  most  evident  model  is  the  Schrodinger-Poisson  approach  which  has  been  widely 
used  (see  e.g  [1],  [2],  [3],  [4],  [5]).  However,  the  solution  of  the  eigenvalue  problem  in  an 
iterative  loop  is  numerically  rather  expensive,  such  that  it  might  be  interesting  to  look 
for  alternatives. 

It  is  well  known  that  under  the  assumption  of  Boltzmann  statistics  the  steady  state 
density  matrix  can  be  obtained  from  the  so  called  Bloch  equation  (see  e.g  [6],  [7],  [8]). 
The  Bloch  equation  is  a  parabolic  equation  in  which  the  role  of  the  time  is  played  by 
the  reciprocal  of  the  temperature.  The  spatial  operator  is  the  negative  Hamiltonian  and 
the  initial  datum  is  a  delta-distribution.  The  charge  density  is  obtained  by  evaluating 
the  density  matrix  at  the  normalized  diagonal.  The  appropriate  coupling  to  the  Poisson 
equation  leads  to  a  non-st2uidard  evolution  problem  ([8],  [9]). 

We  present  the  generalization  of  the  linear  Bloch  equation  to  a  self-consistent  model  for 
arbitrary  occupation  probabilities.  A  rational  approximation  of  the  distribution  function 
yields  an  iteration  in  the  temperature  variable,  the  position  variable  can  be  discretized 
e.g.  by  spectral  methods.  Due  to  the  specific  coupling  to  the  Poisson  equation  Bloch-type 
equations  give  the  steady  state  density  matrix  only  for  one,  fixed  temperature. 

II  Modeling 

In  the  usual  approach  the  state  of  the  system  is  described  by  one-particle  wave  functions 
in  some  L^-space  (e.g.  [1],  [2],  [3],  [4],  [5]).  One  has  to  calculate  the  eigenfunctions  of  the 
stationary  Schrodinger  equation 

(11.1)  HV’,(r)  =  Eixl^i[r),  =  1,  i  e  No 

The  Hamilton  operator  reads 

(11.2) 
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with  some  local  potential  V(r)  taking  into  account  the  direct  Coulomb  interaction  of  the 
electrons.  Additional  terms  like  an  applied  external  potential,  heterojunction  discontinu¬ 
ities,  exchange-correlation  terms  etc.  can  be  included  without  changing  the  algorithms. 
The  (electron)  density  is  given  by 

(II.3)  n(r)  =  Yi  fiE,  - 

»€A/o 

where  /(F,  —  F,  /3)  is  the  probability  of  finding  an  electron  in  the  state  V’i.  Thermodynam¬ 
ics  enter  via  the  reciprocal  temperature  /3  =  1/fcT,  where  k  is  the  Boltzmann  constant 
and  T  the  temperature. 

In  order  to  achieve  self-consistency  we  calculate  V{r)  as  a  function  of  n(r)  by  simply 
solving  the  Poisson  equation.  Note  that  this  direct  coupling  yields  a  /?-dependence  of  all 
quantities. 

For  electrons  we  should  use  Fermi  -  Dirac  (FD)  statistics  ; 

(II-4)  fpoiEi  —  F,^)  =  ^0^Ei-F)  ^  j 

which  are  approximated  by  Boltzmann  statistics  for  high  temperatures  (small  /?) 

(11.5)  fsiEi  -F,/3)  = 

The  “Fermi  energy  F”  and  Z  are  (/^-dependent)  normalization  constants  : 

(11.6)  J  n{x)dx  =  1 

As  an  alternative  to  the  Schrodinger-Poisson  equation  the  (charge)  density  n(x)  for  a 
given  temperature  /3  can  be  expressed  using  what  we  call  “Bloch-ansatz”  : 

(11.7)  n(x)  ^Tr  f{H-F-  id) 

Here  we  mean  “trace”  in  the  following  sense  ; 

(11.8)  n(x)  =  p(r  =  x,s  =  x,  /3*)  =  p{x,  x) 

where  />(r,  s,  j3*)  is  the  (normalised)  density  matrix,  i.e.  the  integral  kernel  of  the  density 
operator  f{H  —  F  ■  id,P’).  We  have 

(11.9)  p(r,  s,  /?*)  =  f{H-F-  id,  nHr  -  s) 
or 

(11.10)  p(r,  s,  =  Y  f(^}  -  /?*)V’>(r)V’;(s) 

i=i 

which  shows  the  equivalence  of  the  Bloch-ansatz  to  the  Schrodinger  approach. 

For  the  Boltzmann  distribution  (II.7)  yields  the  “clMsical”  Bloch  equation  [6]  as  an 
evolution  equation  (in  /?)  for  the  unnormalized  density  matrix. 


where  Hr  means  acting  on  the  r  varibable  only. 


(11.12) 


p{r,3,/3  =  0)  =  ^(r-  s) 


(11.13) 


n(r) 


p{r,r,0‘) 

i  p{y,y,P'’)dy' 


/3*  fixed 


We  had  to  fix  fi*  since  the  coupling  to  the  Poisson  equation  requires  attention  :  A  direct 
coupling  for  temperature  dependent  n(i,  /3)  ;=  p(i,  i,  /3)  yields  a  /3-dependent  Haunil- 
tonian  and  the  Bloch  equation  is  not  valid.  Only  p(r,  s,  ^*)  has  a  physical  meaning  for 
the  selfconsistent  Bloch  equation  ([8],  [9]). 

For  other  than  the  Boltzmann  distribution  no  such  parabolic  evolution  equations  can 
be  derived.  However,  we  can  still  derive  iterations  for  the  density  matrix  withou.  solving 
the  eigenvalue  problem  (II. 1),  (11-2). 


I 

III  Numerical  methods 


For  the  Boltzmann  statistics  case  the  following  relaxed  iteration  for  the  potential  has  been 
proven  to  be  convergent  in  [8]  :  For  given  14  the  Bloch  equation  (II. 11),  (11.12)  is  solved 
for  /3  €  (0,/3*],  the  density  (11.13)  is  evaluated  for  fixed  (3  —  and  the  Poisson  equation 
yields  a  potential  14+ 1  which  gives  the  new  potential 

(III.14)  (x)  =  (1  -  u>)14  (x)  -b  u>i4+i  (x),  <  1 

For  Fermi-Dirac  statistics  it  is  possible  to  use  an  analogous  iteration  which  avoids  the 
explicit  calculation  of  the  Fermi-level  [10]  : 


(111.15) 

(111.16) 

(111.17) 


Pk  (r,  s,  /3) 

D 

n\pk]  (x)  =  p{x,  X,  /?*) 

A14[pfc](x)  =  -n[pt](x) 
Hk\pk\  =  —A  +  Vk\pk] 
(1 


(111.18) 

(111.19) 

(111.20) 


=  i9e(0,/3*] 

p(/?  =  0)  =  S(r  -  s)  -  pk{r,  s,  =  0) 

(I 

Pk+ly^  —  .  » .  j 

/  p(y,  y,  0)dy 


Like  in  the  Boltzmann  case  the  iteration  has  to  be  appropriately  underrelaxed  to  as¬ 
sure  convergence.  Underrelaxed  iterations  have  been  used  for  the  Schrodinger  -Poisson 
equation  ([4],  [5]). 
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However,  the  numerical  results  of  Nier  [4]  show  that  the  optimal  underrelaxation 
parameter  u;  depends  heavily  on  the  Debye  length  Aj  of  the  considered  device  and  that 
w  has  to  be  extremely  small  for  realistic  values  of  A^.  On  the  other  hand,  the  iteration 
(III.15)  -  (III.20)  applies  only  for  F-D  statistics  as  given  by  (114),  which  looks  different 
for  the  2-D  case  (see  e.g.  [1]). 

The  method  we  propose  is  based  on  a  rational  (Pade)-approximation  of  an  arbitrary 
distribution  function  /(/?(!?,  —  F))  : 

(111.21)  f{z)  ^  hz) = n 

t=i 

To  understand  the  idea  we  take  the  Boltzmann  case  /(  ''  =  and  solve  the  Bloch 
equation  (11.11),  (1112)  with  an  implicit  Euler-scheme  : 

(111.22)  {I  +  A0Hr)pt+i  =  Pt,  k  =  0,...,N-l,  N-A0  =  0- 

(111. 23)  po  =  S(r  —  s) 

Hence  we  see  that  the  numerical  iteration  corresponds  to  the  approximation 

(111.24)  f(z)  =  e-o-'  ^  (1  +  ^z)-"  =  nz)  =  n -fw 


In  general  we  have 

N 

(III.25)  -  F))  =  n(^  -Fid  +  Cf  id)-\atH  -  atF  ■  id^bi) 

/=i 

where  0*  is  contained  in  the  coefEzients  a<,  c/  which  have  to  be  determined  once  from 
the  (Fade)  -  approximation. 

Hence  we  use  the  following  iteration  for  the  potential  V  and  the  Fermi-level  F: 


(III.26) 

V*(x),  F*  (Ffc_,) 

Hk  =  -A  +  Vk 

(III.27) 

{aiHk  -  aiFk  ■  id  +  ht)(Hk  -  Fk  •  id  +  ct  •  id)~ht+i 

(III.28) 

zo  =  S{r  —  s) 

(III.29) 

PklVk,  Ffc](r,  s,  0”)  :=  ZN.i{r,s) 

a- 

(III.30) 

nk[Vk,  Fit]  (i)  =  pk{x,  x,0‘) 

a- 

(III.31) 

Ck[Vk,  Fk]  :=  J  nk[H,  Ffc](x)di 

i 


I 


i 


i 


i 
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(III.32) 


Vfc+i  (x)  solving  -  AVk+i  (x)  =  CklYk 

(111.33)  Ffc+i  is  given  implicitly  by  the  normalization  (II.6) 

A  secant  scheme  using  Ck  and  also  Ft_j,  Ck-i  gives  : 

(111.34)  F*+,  =  Fk-  (Ck  - 

Note  that  in  general  the  iteration  (III.26)-(III.33)  does  not  correspond  to  any  differ¬ 
ential  equation  like  the  Bloch  equation  for  Boltzmann  statistics. 

The  above  iteration  has  been  tested  for  a  1-D  model.  The  discretization  in  x  has  been 
done  both  with  finite  differences  on  an  equidistant  grid  and  with  a  spectral  collocation 
method  using  trigonometric  basic  functions.  Homogenous  Dirichlet  boundary  conditions 
have  been  used. 

First  results  have  shown  a  good  convergence  with  a  moderate  number  (below  10)  of 
iterates.  Indeed,  the  Pade-approximation  with  =  6  is  basically  indistinguishable  from 
the  exact  FD-distribution.  The  implementation  of  a  2-D  code  using  the  spectral  method 
is  ongoing. 
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Abstract 

A  review  of  the  simulation  of  electronic  transport  at  and  over  quantum  weU  heterostruc¬ 
tures  is  given.  A  large  number  of  experimental  results  relating  to  thermionic  emission, 
phonon-bottlenecks,  electron-electron  interaction,  tunneling,  as  well  as  spontaneous  and 
stimulated  emission  will  be  reviewed  in  the  presentation,  while  this  short  note  will  con¬ 
centrate  on  effects  pertinent  to  quantum  well  lasm. 

I.  Introduction 

Transport  over  a  quantum  well  and  recombination  of  dectrons  (holes)  in  the  well  form, 
in  principle,  a  formidable  problem.  Obviously  the  transport  in  the  immediate  vicinity  of 
the  well  can  only  be  treated  by  quantum  methods,  while  in  the  remainder  of  the  structure 
(laser)  the  transport  is  well  described  in  a  semiclassical  way  using  drift-diffusion  theory  and 
extended  versions  of  it.[l,  2]  To  bring  classical  and  quantum  transport  regions  together 
requires  sophisticated  theories  such  as  the  Bardeen  TVansfer  Hamiltonian  approach[3] 
or  the  Landauer-Buttiker  theory(4,  5]  extended  to  include  inelastic  processes.  Under 
ordinary  circumstances  a  rate  equation  approach  and  rates  such  as  those  used  by  Brum 
and  Bastard[6]  may  be  sufficient.  However,  subtleties  with  the  Pauli  Priciple  and  the 
ranges  and  normalizations  of  classical  and  quantum  regions  present  problems.  [7] 

In  the  presentation  related  to  this  paper,  a  number  of  cases  of  generation-recombination 
and  transport  near  quantum  wells  is  going  to  be  discussed.  In  this  paper,  however,  we 
discuss  only  a  few  facts  relevant  to  the  development  of  complete  tools  for  semiconductor 
laser  simulation.  It  is  in  the  modulation  response  of  lasers  that  the  subtleties  of  quantum 
well  transport  most  obviously  manifest  themselves.  The  model  we  propose  is  not  to  be 
viewed  as  a  final  version  but  as  a  starting  point  for  a  complete  numerical  approach. 

n.  The  Multiscale  Problem 

As  mentioned,  transport  in  heterostructures  such  as  a  quantum  well  laser  poses  a  difficult 
problem.  The  quantum  well  must  be  treated  appropriately  because  it  is  the  region  of  pos¬ 
itive  gain,  and,  therefore,  it  is  the  most  critical  determinate  of  optical  output.  However, 
the  refpmts  surrounding  the  quantum  wdl  also  play  important  roles  in  the  device  char¬ 
acteristics.  For  example,  the  current  blocking  regions  significantly  influence  the  effective 
quantum  efficiency  of  the  device.  Also,  the  transport  in  the  separate  confinement  tenons 
can  lead  to  low  frequency  ioll-<^  in  the  modulation  response[8]  and  gain  saturation.  [9, 10] 
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Figure  1:  Triangular  tesselation  of  a  quantum  well  device  is  inappropriate  due  to  a  small 
aspect  ratio.  A  combination  of  rectangles  and  triangles  produces  a  mesh  consistent  with 
the  quasi-one-dimensional  nature  of  quantum  wells. 

As  a  result,  the  simulation  of  such  structures  is  also  a  multiscale  problem. 

The  first  difficulty  with  such  a  multiscrle  problem  involves  the  tesselation  of  the  device.  It 
is  standard  practice  to  use  triangles  to  discretize  a  complicated  device  structure  because 
they  conform  well  to  nonrectjmgular  features,  such  as  contoured  current  blocking  regions. 
However,  in  the  case  of  quantum  well  structures,  triangles  have  a  serious  drawback.  A 
well  formed  triangle  usually  has  an  aspect  ratio  on  the  order  of  one.  If  a  ridge  waveguide 
laser  structure  with  a  10  fim  wide  and  100  A  thick  quantum  well  is  being  simulated,  then 
the  use  of  triangles  leads  to  an  explosion  in  the  number  of  mesh  points.  Rectangles,  on 
the  other  hand,  lend  themselves  very  well  to  the  quasi-one-dimensional  nature  of  quan¬ 
tum  wells.  As  a  result,  the  use  of  rectangles  and  triangles  produces  the  most  convenient 
tesselation  for  a  quantum  well  device.  Figure  1  shows  two  meshes  generated  for  a  simple 
quantum  well  structure.  The  more  flexible  aspect  ratio  of  the  rectangles  produces  a  much 
more  appropriate  mesh. 

m.  Coupling  the  Classical  and  Quantum  Regions 

In  addition  to  optimizing  the  number  of  mesh  points,  the  use  of  rectangles  in  and  near  the 
quantum  well  is  crudal  to  the  treatment  of  transport  in  these  regions.  Rectangles  allow 
the  quantum  well  to  be  divided  up  into  transverse  cross-sections.  The  mesh  points  in  each 
cross-section  can  then  be  used  to  solve  a  one-dimensional  Schrodinger’s  liquation,  giving 
the  eigenenergies  and  wavefunctions  for  the  bound  states  in  the  well.  IVansport  in  and 
around  the  well  can  then  be  treated  in  a  special  way.  Figure  2  is  a  schematic  represention 


Figtire  2:  The  schematic  diagram  shows  a  transverse  cross-section  of  the  quantum  well. 
The  classical  and  quantum  mechanisms  that  determine  the  carrier  distributions  are  la¬ 
belled  where  they  apply.  Note  that  the  circles  represents  mesh  points  in  the  discretization, 
not  charge  carriers. 


of  this  treatment.  The  figure  shows  the  conduction  band  edge  and  the  ground  state 
wavefunction  for  one  of  the  transverse  cross-sections  of  the  well.  Drift-diffusion  theory 
describes  the  transport  up  to  the  quantum  well.  The  particle  fluxes  are  expressed  in  the 
following  form. 

J«  =  -/inWc:Fo(f7„)(rVl7„  +  VJSe) 

Jp  —  ~'I*P^v^o{Vp){T^TIp  —  V£v) 


where, 


Tin  =  {Fn  -  Ec)/T,r,p  =  (Ev  -  Fp)/T 


Fi 


1  /«>  ,  x' 

jlJo 


In  each  of  the  flux  equations,  the  first  term  in  parentheses  is  the  diffusion  term  and 
the  second  is  the  drift  term,  where  the  gradient  in  the  band  edge  determines  the  field. 
Thermionic  emission  theory  then  determines  the  flux  of  carriers  into  or  out  of  the  well 
according  to  the  following  expression. 


i  =  4;T* 


2mim2 
mi  -l-mj 


|exp( 


Epni  ~  Ec2  \ 
kT  ) 


-  exp  ( 


Epnl  —  Ec\  —  Eb 
kT 


)} 


where,  =  the  Richardson  constant. 

Inside  the  quantum  well,  classical  transport  is  not  valid  in  the  transverse  direction;  it 
is  the  form  of  the  wavefunctions  that  determines  the  distribution  of  carriers  inside  the 
well.  When  carriers  are  injected  into  the  well,  they  conform  to  the  distribution  estab¬ 
lished  by  the  wavefunctions  almost  instantaneously.  Classical  transport  is  eliminated  by 
requiring  that  the  quasi-Fermi  levels  inside  the  well  are  constant  in  the  transverse  direc¬ 
tion.  Their  absolute  positions  are  determined  by  the  carrier  fluxes  into  or  out  of  the  well. 
Consequently,  it  is  the  self-consistent  agreement  between  drift-diffusion  and  thermionic 
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emission  transport  that  ultimately  determines  the  filling  of  the  well  and,  thus,  the  gain  of 
the  laser.  Inclusion  of  quantum  reflections  and  resonances  represents  a  straight  forward 
extension  of  this  treatment. 

Classical  transport  is,  however,  permitted  in  the  lateral  direction.  And  again,  the  rectan¬ 
gular  tesselation  enables  this  to  be  treated  correctly.  The  lateral  transport  is  treated  with 
drift-diffusion  theory.  The  divergence  in  the  carrier  flux  is  calculated  for  each  point  in  a 
transverse  cross-section  of  the  well,  but  these  divergences  are  added  together  to  determine 
the  continuity  equation  for  the  wavefunction  as  a  whole.  The  result  is  the  propagation 
along  the  quantum  well  of  entire  wavefunctions  that  represent  the  carrier  distribution, 
and  maintaining  the  shape  of  the  wavefunctions  as  carriers  propagate  ia  necessary  for  the 
correct  simulation  of  quantum  systems. 

The  importance  of  coupling  the  classical  and  quantum  regions  correctly  is  evident  in  fig¬ 
ure  3.  This  figure  shows  the  modulation  responses  for  two  different  lasers.  Elach  laser  has  < 

5000  A  AlGaAs  separate  confinement  regions  on  each  side  of  a  100  A  GaAs  quantum 
well.  However,  for  the  device  in  the  top  figure,  the  optical  confining  region  was  ungraded, 
whereas  for  the  bottom  device,  it  was  linearly  graded.  The  ungraded  device  shows  low 
frequency  roll-off  and  increased  saturation  of  the  resonant  peaks  when  compared  to  the 
graded  device.  The  roll-off  is  due  to  slow  carrier  drift  in  the  separate  confinement  region,  { 

and  the  gun  saturation  is  the  result  of  diffusive  diode  current.  Even  though  the  active 
regions  are  identical  in  the  two  devices,  the  optical  output  characteristics  are  very  dif¬ 
ferent  due  to  transport  in  the  surrounding  bulk  regions.  Consequently,  it  is  critical  that 
the  classical  regions  are  properly  coupled  to  the  quantum  region  in  order  to  accurately 
calculate  the  device  performance.  i 


TV.  Accounting  for  Carrier  Capture 

Although  the  method  described  above  is  an  effective  way  of  coupling  the  classical  regions 

of  a  laser  with  the  quantized  active  region,  there  is  an  element  missing  that  may  affect  4 

the  measured  characteristics  of  the  device.  We  discussed  the  way  in  which  thermionic 

emission  theory  is  used  to  determine  the  filling  of  the  well.  Thermionic  emission  theory 

assumes  that  the  carriers  on  one  side  of  a  heterojunction  are  in  thermal  equilibrium  with 

themselves  but  not  with  carriers  on  the  other  side  of  the  junction.  This  is  valid,  but  it  also 

assumes  that  once  a  carrier  passes  to  the  other  side  of  the  junction  it  immediately  relaxes  4 

and  is  in  thermal  equilibrium  with  carriers  on  that  side.  It  does  not  consider  the  finite 

time  it  takes  for  the  hot  injected  carrier  to  lose  its  excess  energy.  This  time  is  typically 

on  the  order  of  a  picosecond  or  less.  However,  due  to  the  self-consistent  agreement  of  the 

thermionic  emission  current  injected  into  the  well  and  the  drift-diffusion  current  in  the 

surrounding  regions,  any  finite  hot  carrier  concentration  in  the  well  can  lead  to  increased  4 

carrier  densities  in  the  separate  confinement  regions.  [7]  This  increase  in  carrier  density 

wiU  produce  a  diffusive  barrier  to  transport  and  increase  gain  saturation,  thereby  affecting 

the  laser  characteristics. 

The  coupling  of  the  classical  and  quantum  regions  described  above  can  be  extended  to  4 

account  for  the  relaxation  of  hot  quantum  carriers  into  bound  states.  Figure  4  schemat¬ 
ically  represents  this  extension.  As  before,  the  conduction  band  edge  and  the  ground 
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Figure  3:  The  modulation  reaponaes  for  two  different  100  A  GaAs  quantum  well  lasers. 
Each  device  has  5000  A  AlGaAs  separate  confinement  r^ons  on  each  side  of  the  quantum 
well.  However,  the  optical  confinement  r^km  for  the  top  device  is  ungraded  while  the 
confinement  region  for  the  bottom  device  is  linearly  graded. 
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Figure  4:  The  schematic  diagram  shows  a  transverse  croes-section  of  the  quantum  well. 
The  classical  and  quantum  mechanisms  that  determine  the  carrier  distributions  are  la¬ 
belled  along  with  the  scattering  between  quasi-bound  and  bound  quantum  states. 

state  wavefunction  for  a  transverse  cross-section  of  the  quantum  well  is  shown.  Like  the 
previous  method,  drift-diffusion  theory  is  used  to  determine  carrier  transport  up  to  the 
well,  and  thermionic  emission  theory  is  used  to  determine  injection  into  the  well  region. 
However,  now  carriers  are  not  injected  into  bound  states  but  rather  into  higher  energy, 
quasi-bound  states.  This  is  achieved  by  simply  using  thermionic  emission  with  zero  bar¬ 
rier  height.  The  quasi-bound  states  are  not  assumed  to  be  in  thermal  equilibrium  with 
the  bound  states  in  the  quantum  well.  They  have  a  separate  quasi-Fermi  level  to  deter¬ 
mine  their  occupancy.  The  quasi-bound  states  exchange  carriers  with  the  bound  states 
through  a  scattering  lifetime,  e.g.  as  calculated  in  [6].  The  result  wiU  be  a  finite  hot 
quantum  carrier  concentration,  the  consequence  of  which  can  only  be  determined  by  the 
self-consistent  solution  of  the  transport  equation  throughout  the  rest  of  the  device. 


V.  Conclusion 

The  characteristics  of  quantum  well  devices  are  most  critically  dependent  on  the  proper¬ 
ties  of  the  well,  which  can  be  accurately  treated  only  with  quantum  mechanical  methods. 
However,  the  device  performance  can  also  depend  strongly  on  transport  in  the  remainder 
of  the  device,  as  we  have  shown  with  the  modulation  response  of  semiconductor  lasers. 
Transport  in  these  other  regions  is  most  tractable  when  treated  with  classical  theories, 
but  this  leads  to  the  problem  of  coupling  the  quantum  treatment  of  the  active  region  with 
the  classical  treatment  of  the  surrounding  regions.  To  do  this  correctly  requires  complex 
mesoscopic  theories  which  are  not  easily  implemented  in  computer  simulation.  In  this  pa¬ 
per,  we  have  presented  a  model  which  serves  as  a  starting  point  for  the  numerical  solution 
of  this  difficult  problem. 
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Abstract 

We  study  the  simulation  of  quantum  cellular  automata  and  how  such  a  simulation  is  simplified 
by  the  features  of  FORTRAN-90.  We  demonstrate  the  use  of  second-quantized  operators  to 
write  the  cell  Hamiltonian  and  explain  the  Hartree  self-consistent  method  for  simulating  a 
many-cell  system.  Finally,  several  examples  of  simulated  QCA  devices  are  shown. 

I.  Introduction 

The  particular  quantum  system  we  simulate  consists  of  several  of  the  quantum  cells  shown  in 
Fig.  (la).  We  determine  the  ground  state  of  the  system  by  solving  the  time-independent 
Schrddinger  equation.  Each  cell  consists  of  five  coupled  quantum  dots  which  contain  a  total  of 
two  electrons.  The  cells  only  interact  with  each  other  Coulombically;  no  tunneling  is  allowed 
between  cells.  Since  the  state  of  each  cell  is  affected  by  its  nearest  neighbors,  we  call  such  a 
system  a  quantum  cellular  automaton  (QCA). 

Because  of  Coulombic  repulsion  between  the  two  electrons  in  each  cell,  the  charge  density 
exhibits  strongly  bistable  behavior.  The  ground  state  of  the  cell  is  therefore  in  one  of  the  two 
sutes  shown  in  Fig.  (lb).  Because  of  this  bistable  nature,  we  can  use  the  state  of  each  cell  to 
encode  binary  infcnmation.  We  define  a  cell  polarization  which  measures  to  what  extent  the 
cell  is  in  one  of  the  two  suble  states  shown  in  Fig.  (lb): 

(P1  +  P3)  -  (P2  +  P4) 

“  P0  +  P1  +  P2  +  P3  +  P4 


b) 

P  =  +l  P  =  -l 

Figtue  1.  Schematic  of  the  basic  five-site  cell,  (a)  The  geometry  of  the  cell  with  tB0.3meV,  t’st/10, 
and  ap20  nm.  (b)  Coulombic  repulsion  causes  the  elections  to  occupy  antipodal  sites  within  the 
cell.  The  two  bistable  states  have  cell  polarizations  of  Pa-t-I  and  IV-I(See  equation  (1)). 
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II.  S«conci-Quantiz«d  Hamiltonian 


We  define  a  second-quantized  annihilation  operator,  g,  which  destroys  an  electron  on  site  i 
with  q)in  o.  and  a  creation  operate,  i,  o.  which  creates  a  particle  on  site  i  with  spin  o.  The 
imduct  of  these  two  t^raun^,  g  =  gfi^  i,  g,  is  the  number  operator  for  that  site  and  spin. 

Using  these  operates  we  can  compactly  write  the  Hubbard-type  tight-binding  Hamiltonian  of 
a  single  isolated  cell: 


IV.  -  I 


This  Hamiltonian  includes  on-site  energies,  ninneling  between  sites,  on-site  charging  costs,  and 
Coulombic  repulsion  between  each  pair  of  sites.  The  interaction  with  neighboring  cells  alters 
the  on-site  energies  in  the  first  term. 


III.  Unique  Features  of  Fortran-90 

Fortran-90  supports  a  level  of  data  abstraction  sufficient  to  allow  direct  implementation  of 
these  second-quantized  operators  and  the  related  state  vectors.  We  have  created  user-defined 
types  representing  creation  and  annihilation  operators  and  many-electron  site  kets  and  bras.  We 
also  provide  functions  to  convert  between  these  types  and  to  define  the  effect  of  each  operator 
on  all  other  data  types. 

The  second  useful  feature  of  Fortran-90  for  our  purpose  is  operator  overloading.  This  allows  us 
to  use  an  t^rator  without  regard  to  the  data  types  it  acts  upon.  We  then  provide  an  interface 
that  invokes  the  appropriate  function  based  on  the  data  types  involved.  In  this  case,  the  action 
of  a  creation  or  annihilation  tqrerator  on  a  Dirac  ket  in  the  site  representation  is  specified.  The 
operation  is  “overloaded”  onto  the  normal  multiplication  symbol.  A  similar  overloading 
specifies  that  multiplication  of  a  Dirac  bra  and  ket  be  interpreted  as  the  inner  product  of  the 
state  vector. 

Fig.  (2)  shows  a  segment  of  Fortran-90  code  that  demonstrates  how  easily  quantum  mechanical 
expressions  can  be  written  as  Fortran  code  once  these  definitions  are  in  place.  The  code  is 
easily  understandable  because  the  level  of  data  abstraction  matches  the  level  of  quantum 
mechanical  abstraction. 


<>,l  ^  I  (otj  oOj  o  +  aty,  Ii2> 

*••/»  ®  n 


!  —  Hopping  terms  -- 
do  ist-l,NSTATES 
do  jst-l.NSTATES 
do  ispl-1,2 

H(il,i2)-H(il,i2)+lcet2bta(base»cet(il)  )•  <T0(ist)  •  4 
4ad(ist,  ispl)  *a  ( jst,  lapl)  *oaselcet  (12) ) 

H (il , i2) -H (il, i2) +ket2bra (baseket (il) ) * (TO (ist ) *  4 
4ad( jst, ispl) *a (ist, ispl) *baseket (12) ) 
end  do 
end  do 
end  do 


Figure  2.  Conversion  from  second-quantized  quantum  mechanical  expression  to  PORTRAN-90 
coi^  using  data  abstraction  techniques. 

IV.  Hartree  Self-Consistent  Calculations 

Since  electrons  are  not  allowed  to  tunnel  between  cells,  we  can  solve  for  the  ground  state  of 
each  cell  separately.  Such  intracellular  calculation  includes  exchange  and  correlation  effects 
exactly.  The  interaction  between  cells  is  included  using  a  Hartree  self-consistent  technique. 
Once  the  iterative  solution  of  the  system  has  converged,  the  system  is  in  an  eigenstate.  Use  of 
several  different  initial  conditions  and  comparison  of  the  eigen-energies  allows  us  to  determine 
the  overall  ground  state  of  the  system. 


V.  Application:  Quantum  Cellular  Automata 

We  have  used  this  scheme  to  simulate  many  arrangements  of  quantum  ceUs.  The  most 
fundamental  oi  these  calculations  is  shown  in  Fig.  (3a).  The  system  consists  of  two  cells  as 
shown  in  the  inset  The  charge  density  of  ceil  2  is  fixed,  and  the  induced  polarization  in  cell  1 
is  then  calculated.  This  is  repeated  for  many  values  of  P2  in  the  range  [-1.+1]  and  the  induced 
polarization  can  then  be  calculated  as  a  function  of  ^2-  cell-cell  response  function 
demonstrates  the  highly  nonlinear  and  bistable  nature  of  the  cell’s  response  to  its  neighbors. 

Fig.  (3b)  shows  a  similar  cell-cell  response  function  calculated  at  several  non-zero 
temperatures.  This  requires  calculating  the  thermal  expectation  value  of  the  polarization  over 
the  canonical  ensemble.  As  seen  in  the  figure,  the  nonlinearity  of  the  response  degrades  as  the 
temperature  increases. 
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Figure  3.  Cell-cdl  response  functions,  (a)  The  cell-cell  response  function  for  the  basic  five-site  cell 
at  zero  temperature.  The  solid  line  correqionds  to  the  antisymmetric  case,  and  the  dotted  line  to  the 
symmetric  case,  (b)  The  temperature  dependence  of  the  cell-cell  response  function. 

Fig.  (4)  shows  three  of  the  basic  QCA  devices.  Fig.  (4a)  demonstrates  that  even  a  weakly 

polarized  cell  can  drive  a  line  of  similar  cells  and  that  the  bistable  saturation  of  the  cell-cell 

response  will  return  the  signal  to  maximum  polarization  in  subsequent  cells.  Fig.  (4b) 

demonstrates  that  a  signal  will  propagate  through  a  right-angle  turn  without  degradation,  and 

Fig.  (4c)  shows  that  a  single  line  of  cells  can  fan  out  to  multiple  lines  and  maintain  signal 

integrity. 
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Figure  4.  Three  basic  QCA  devices,  (a)  A  line  of  cells  can  be  used  to  transmit  information 
frm  one  point  to  another,  (b)  The  sig^  is  uansmitted  correctly  around  a  comer,  (c)  A  single 
line  can  fan  out  correctly  to  multiple  lines  with  the  same  sig^.  These  are  not  schematic 
diagiams;  they  are  |dots  of  the  actual  results  of  the  self-consistent  calculation  of  the  ground 
stale  of  each  ^siem.  The  diameter  of  each  dot  is  proportional  to  the  charw  density  on  the  site. 
The  cells  with  heavy  borders  are  fixed;  all  others  are  free  to  react  to  the  fixed  cha^e. 


Fig.  (S)  shows  that  the  state  of  a  free  cell  matches  the  majority  of  its  fixed  neighbors.  This 
majority  voting  logic  can  provide  the  basis  of  a  new  computing  architecture.  If  one  of  the  fixed 
neighbcnrs  is  called  the  “program  line”,  such  an  arrangement  of  cells  can  be  interpreted  to  be  a 
programinable  AND/OR  gate.  The  program  line  detennines  the  nature  of  the  gate  (AND  vs. 
OR),  and  the  other  inputs  are  applied  to  the  gate  thus  defined. 
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Figure  S.  The  programmable  ANDA3R  gate.  Here,  the  program  line  is  set  to  one,  so  the  other  two 
inputs  are  being  applied  to  an  OR  gate.  In  each  of  the  four  cases,  the  output  is  one  if  either  of  the 
two  inputs  are  one.  This  is  a  plot  <m  the  result  of  a  self-consistent  ground  state  calculation. 


Finally,  Fig.  (6)  shows  how  to  cross  two  lines  of  cells  without  having  the  signals  interfere. 
Wire  crossings  are  very  important  for  implementation  of  devices  like  adders  and  exclusive-OR 
gates.  Such  a  quasi-two-dimensional  crossing  is  impossible  with  conventional  devices. 
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Figure  6.  One  way  to  cross  two  QCA  wires  without  signal  interference.  The  box  shows  the  extent 
of  the  crossing,  so  a  system  designer  can  simply  place  such  an  arrangement  of  cells  wherever  two 
wires  need  to  be  crowed.  This  is  the  result  of  a  self-consistent  ground  state  calculation  for  this 
system. 
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Abstract 

A  comprehensive  self-consistent  Schrodinger-Poisson  simulation  is  presented.  The  model 
takes  into  account  the  various  regions  of  reduced  dimensionality  throughout  a  particular 
structure  which  can  consist  of  OD,  ID,  2D  electron  gases  or  bulk.  The  feasibility  of  this 
level  of  simulation  is  made  possible  by  an  iterative  extraction  orthogonalization  method 
for  solving  the  Schrodinger  equation.  This  method  is  superior  to  conventional  eigenvalue 
solvers  since  it  generates  an  arbitrary  number  of  dgenstates  and  easily  couples  to  the 
Poisson  equation.  High  order  effects  such  as  exchange/correlation  and  single-electron 
charging  are  also  included  in  the  model.  Transport  is  treated  in  the  linear  response 
regime  and  used  to  investigate  the  Coulomb  blockade  oscillations  observed  by  Meirav[l]. 

I.  Introduction 

Recent  advances  in  semiconductor  fabrication  techniques  have  resulted  in  a  large  variety  of 
new  experimental  devices  exhibiting  quantum  effects  due  to  high  degrees  of  confinement. 
In  order  to  provide  a  realistic  model  for  the  physical  analysis  and  design  of  these  struc¬ 
tures,  we  have  developed  a  comprehensive  self-consistent  simulation  tool  that  merges  the 
statistical  and  quantum  mechanical  aspects  of  the  problem.  In  this  paper,  we  give  the  the¬ 
oretical  background  of  the  model  and  its  application  to  the  analysis  of  the  single-electron 
charging  effects  on  transport  through  a  quantiim  dot[l].  Previous  efforts  at  simulating 
quantum  structures  have  focused  on  localized  regions  in  the  device  which  exhibit  reduced 
degrees  of  dimensionality.  Although  these  simulations  have  yielded  much  information  on 
the  general  quantum-mechanical  properties  in  a  device  active  region[2,  3,  4],  their  failure 
to  integrate  the  dots  (OD), leads  (ID),  contacts  (2D),  and  bulk  re^ons  in  a  particular  geom¬ 
etry  prevents  them  from  achieving  good  quantitative  agreement  with  experimental  data. 
This  lack  of  coherence  has  been  due  in  large  part  to  computationid  time  constraints  in 
solving  the  Schrodinger  equation  over  an  arbitrarily  large  number  of  grid-points,  Nq.  Con¬ 
ventional  eigenvalue  solvers  typically  scale  as  Nq  and  generate  Nq  eigenvectors  thereby 
restricting  their  application  to  problems  with  small  grids.  In  a  gener^  quantum  device, 
however,  dgenvalue  problems  need  to  be  solved  in  all  regions  eidiibiting  reduced  dimen¬ 
sionality  and  require  a  large  Nq.  In  addition,  only  a  few  eigenenergies  are  often  necessary 
owing  to  the  small  number  of  occupied  «genstates  in  a  typical  device.  Clearly,  a  robust 
eigenvalue  solver  that  addresses  the  above  issues  and  allows  easy  self-consistent  coupling 
to  the  Poisson  equation  is  needed  to  bridge  the  gap  between  ’’localized”  modeling  and  full 
scale  device  simulations. 
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n.  Theoretical  Background 

We  solve  the  Schrodinger  equation  with  the  iterative  extraction  orthogonalization  method 
(lEOM)  that  propagates  a  set  of  eigenstates  according  to[5] 

|<}  =  iteration  (1) 

where  H  is  the  Hamiltonian  and  n  indicates  a  particular  eigenstate.  The  parameter  a  is 
selected  so  as  to  minimize  the  error  in  the  Taylor  expansion  of  the  exponential  operator 
exp{—aH).  In  general,  Ni  iterations  are  required  to  maintain  the  accuracy  of  the  Taylor 
expansion  and  eliminate  all  the  error  projections  of  the  basis  states  |m)  of  H  onto  the 
initial  guess  state  .  Gram-Schmidt  orthonormalization  is  performed  over  the  entire  set 
of  states  after  each  iteration  to  prevent  any  excited  states  from  collapsing  to  the  ground 
state.  This  procedure  also  eliminates  all  projections  of  states  with  m  <  n  such  that  after 
Ni  iterations, 

|tf‘)  ==  w+  E  (2) 

'  m=n+l  I  Vn) 

The  repeated  exponential  scalings  of  the  error  projections  and  Gram-Schmidt  orthonor¬ 
malizations  therefore  eventually  convert  into  a  pure  basis  state  |n).  The  convergence 
criterion  for  the  algorithm  is  determined  by  the  expression 


where  e  is  an  imposed  error  tolerance.  Clearly,  a  should  be  chosen  to  be  as  large  as  possible 
to  reduce  the  number  of  iterations  and  still  allow  a  high  degree  of  accuracy  in  the  Taylor 
expansion  of  exp{—aH).  Eq.  3  point!  out  an  ovnall  limitation  of  this  method  in  that 
the  number  of  iterations  required  to  achieve  convergence  scales  inversely  with  the  energy 
separation  between  eigenstates.  This  problem  can  be  alleviated  somewhat  by  selecting 
initial  states  with  the  appropriate  symmetry  such  that  the  error  projections  vanish  for 
eigenstates  with  opposite  parity.  Following  previous  time  dependent  treatments[6, 7, 8, 9], 
the  exponential  operator  in  Eq.  1  is  typically  cast  into  a  split  form  which  separates  its 
potential  and  kinetic  energy  components.  The  execution  time  of  each  iteration  depends 
on  how  efficiently  the  discretized  kinetic  energy  portion  can  be  solved.  For  the  rectilinear 
geometries  often  encountered  in  quantum  devices,  the  kinetic  energy  operator  can  be  fur¬ 
ther  separated  into  (z,  y,  z)  components  which  can  be  treated  independently  and  solved 
by  rapid  elliptical  solvers.  In  general,  each  application  of  the  propagator  scales  with  No 
and  the  Gram-Schmidt  algorithm  scales  with  NgN\  where  Ne  is  the  number  of  eigenen- 
ergies  required  in  a  particular  simulation.  If  Ne  is  relatively  small,  as  is  often  the  case  in 
nzmostructures,  NaN%  C  Nq  and  the  lEOM  shows  a  significant  improvement  over  con¬ 
ventional  eigenvalue  solvers.  The  chief  advantage  offered  by  the  rapid  convergence  of  this 
method  is  the  ability  to  accurately  treat  the  dimensionality  of  each  region  in  a  quantum 
device  out  to  an  iq>propriate  set  of  boundaries.  We  solve  the  3D  Schrodinger  equation 
in  the  quantum  dot  (OD)  regions,  the  2D  Schrodinger  equation  in  the  lead  (ID)  regions, 
and  ID  solutions  are  obtained  in  the  contact  (2D)  regions.  Semiclassical  solutions  are 


a) 


b) 


Figure  1:  (a)  Experimental  device  geometry  used  for  investigating  single  electron  charing  effects. 
A  negative  bias  on  the  gate  confines  charge  in  the  lateral  direction  to  form  ID  leads  and  a 
quantum  dot  between  the  restrictions.  Modulation  of  the  bottom  gate  bias  adds  electrons  to 
the  quantum  dot  one  at  a  time.(b)  Theoretical  electron  density  for  a  slice  taken  at  the  device 
active  region.  The  formation  of  a  quantum  dot  is  visible  as  an  island  of  charge  between  the  two 
ID  leads. 

used  for  carriers  that  do  not  exhibit  confinement  (holes)  and  regions  that  are  not  gov¬ 
erned  by  quantum  mechanics.  Charge  densities  are  constructed  by  scaling  the  amplitude 
squared  of  each  wavefunction  with  its  appropriate  statistical  weight[10].  The  distribution 
functions  used  in  the  scaling  of  the  OD  localized  states  are  derived  from  the  grand  canon¬ 
ical  ensemble  with  the  constraint  that  only  an  integer  number  N*  of  electrons  occupy  the 
quantum  dot.  N*  is  determined  for  a  given  gate  bias  by  free  energy  minimization[ll].  The 
potential  <^{f)  in  each  region  is  inherently  coupled  by  the  Poisson  equation  which  main¬ 
tains  continuity  of  ^(r)  and  its  first  derivative.  The  Schrodinger  and  Poisson  equations 
2ure  solved  self-consistently  by  a  modified  Newton  method  that  incorporates  a  line-search 
step  to  allow  convergence  at  low  temperatures.  In  addition,  exchange  and  correlation  ef¬ 
fects  are  self-consistently  treated  with  the  Kohn-Sham  approach[12]  using  Perdew-Zunger 
parameterization  for  the  correlation  potential[13].  Presently,  the  simulation  is  carried 
out  in  equilibrium  although  in  principle,  nonequilibrium  solutions  are  possible  for  accu¬ 
rate  evaluations  of  the  quantum-mechanical  current.  We  therefore  compute  the  quantum 
transport  properties  of  devices  in  the  linear  response  regime  which  is  typically  the  case 
for  Coulomb  blockade  measurements. 

III.  Results 

We  have  applied  the  simulation  to  the  analysis  of  single  electron  charing  effects  in  the  in¬ 
verted  semiconductor-insulator-semiconductor  structure  reported  by  Meirav  et  al[l].  Fig. 
1  shows  the  specific  device  geometry  used  in  our  investigation  along  with  a  charge  density 
surface  taken  at  the  device  active  region.  A  negative  bias  on  the  top  gate  confines  charge 


Fignie  2:  (a)  The  quan-lD  ttgenenergjes  and  thw  respective  localized  OD  eigenener^es.  The 
eigenenergies  associated  with  the  first  (scdid)  and  second  (dotted)  quasi- ID  states  are  the  only 
ones  occupied  {Ef  =  QmeV).  The  formation  of  sharp  barriers  leads  to  tunneling  into  and  out 
of  the  quantum  dot.  (b)  The  transport  characteristics  of  the  device  exhibit  sharp  thermally 
broadened  (T  =  SOmK,  Lo  =  .Sfim)  peaks  that  correspond  to  the  addition  of  a  single  electron 
to  the  dot  and  the  subsequent  lifting  of  the  Coulomb  blockade. 


in  the  lateral  direction  to  form  ID  leads  and  a  quantum  dot  between  the  restrictions. 
The  quantum  dot  is  visible  in  Fig.  lb  as  an  island  of  charge  between  the  two  ID  leads. 
In  addition,  the  presence  of  charge  in  the  leads  and  contacts  is  also  apparent.  As  the 
bias  on  the  backgate  is  increased,  electrons  are  added  to  the  dot  in  single  increments. 
This  results  from  the  electrostatic  Coulomb  repulsion  between  localized  electrons  which 
modifies  the  occupation  probability  of  each  level  and  allows  only  an  integer  number  of 
electrons  to  occupy  the  dot.  The  OD  region  is  delineated  by  sharp  barriers  separated 
by  a  distance  £„  which  bring  the  dot  into  weak  contact  with  the  leads  via  tunneling. 
Fig.  2a  shows  the  localized  mgenenergies  hi  the  dot  superimposed  over  their  respective 
quasi-lD  adiabatic  eigenenergies  obtained  by  extending  the  simulation  of  the  leads  into 
the  quantum  dot  region.  Although  two  quasi-lD  modes  have  access  to  the  dot,  the  upper 
channel  is  essentially  closed  owing  to  its  relatively  wide  tunnel  barriers  and  low  statistical 
weight.  The  device  should  therefore  exhibit  single  mode  characteristics.  Transport  in  the 
device  is  calculated  using  the  iqipropriate  expression  derived  for  the  Coulomb  blockade 
regime[14,  15].  Transmission  probabilities  are  evaluated  using  a  transfer  matrix  calcula¬ 
tion  based  on  the  quasi-lD  eigenenergy  as  a  function  of  distance.  The  charge  imbalance 
caused  by  single  electron  occupation  is  modeled  by  minimizing  the  free  energy  with  respect 
to  the  number  of  electrons.  The  difference  between  succesrive  free  enerpes  then  gives  the 
charging  energy  required  to  add  a  single  electron  to  the  dot  for  that  gate  bias[14].  TVans- 
port  characteristics  of  the  device  (Fig.  2)  exhibit  sharp  thermally  broadened  peaks  that 
correspond  to  the  addition  of  a  single  dectnm  to  the  dot  and  the  subseqMnt  lifting  of 
the  Coukmib  blockade.  The  peak  amplitude  and  periodicity  show  good  agreement  with 
the  experimental  datafl]. 


rV.  Conclusion 


In  summary,  we  have  demonstrated  the  viability  of  a  comprehensive  device-scale  simula¬ 
tion  tool  for  analyzing  quantum  devices.  The  lengthy  eigenvalue  calculation,  which  is  the 
chief  obstacle  to  this  level  of  simulation  via  conventional  methods,  has  been  overcome  by 
employing  the  rapidly  converging  iterative  extraction  orthogonalization  method  that  can 
treat  an  arbitrary  number  of  eigenenergies  and  wavefunctions.  Finally,  transport  charac¬ 
teristics  for  a  device  exhibiting  single-electron  charging  effects  were  obtained  and  shown 
to  exhibit  good  aigreement  with  experimental  findings. 
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Abstract 

We  introduce  the  planar  supercell  method  as  a  means  for  treating  3D  quantum  transport 
in  mesoscopic  tunnel  structures.  The  flexibility  of  the  method  allows  us  to  examine  a 
vairiety  of  physical  phenomena  relevant  to  quantum  transport,  including  alloy  disorder, 
interface  roughness,  defect  impurities,  and  OD,  ID,  and  2D  quantum  confinement,  in 
device  geometries  ranging  from  double  barrier  heterostnictures  to  quantum  wire  electron 
waveguides.  As  examples,  we  examined  quantum  transport  in  double  barrier  structures 
with  interface  roughness,  and  in  quantum  wire  electron  waveguides. 

I.  Introduction 

In  modeling  quantum  transport  in  semiconductor  resonant  tunneling  heterostructures, 
one  can  often  assume  perfect  periodicity  in  the  lateral  directions,  thereby  reducing  the 
mathematical  description  to  a  ID  problem  in  which  only  the  potential  variation  along  the 
growth  direction  need  be  considered.  However,  in  realistic  device  structures  we  need  to 
take  into  account  imperfections  such  as  interface  roughness,  impurities,  and  alloy  disorder 
which  are  incompatible  with  the  assumption  of  translational  invariance  in  the  parallel 
directions.  In  this  work,  we  introduce  the  planar  supercell  method  as  a  general  purpose 
model  for  treating  these  structural  imperfections.  The  model  is  designed  for  flexibility 
so  that  it  can  be  used  not  only  to  study  tunnel  structures  such  as  the  double  barrier 
heterostructure,  but  also  lower  dimensional  mesoscopic  devices  such  as  quantum  wire 
electron  waveguides.  Using  this  method,  we  have  studied  how  the  transport  properties 
of  double  barrier  heterostructures  are  influenced  by  interface  roughness.  We  also  briefly 
report  on  a  study  of  transport  in  quantum  wire  electron  waveguides. 

II.  Method 

We  use  a  planar  supercell  tight-binding  Hamiltonian  and  specify  a  structure  as  a  stack  of 
Nj  layers  perpendicular  to  the  z-direction,  with  e2ich  layer  containing  a  periodic  array  of 
rectangular  planar  supercells  of  IV,  x  sites.  Within  each  planar  supercell,  the  potential 
assumes  lateral  variations  as  dictated  by  the  device  geometry.  Figure  1  Ulustrates  a 
set  of  planar  supercell  stacks  used  in  simulating  double  barrier  structures  with  interface 
roughness.  Our  model  is  formadly  equivalent  to  the  one-band  effective  mass  equation  [1] 

-  +  V{x)^  =  Erl>,  (1) 

discretized  over  a  Cartesian  grid,  and  subject  to  periodic  boundary  conditions  (with  su- 
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percell  periodicity)  in  the  x-  and  y-directions,  and  open  boundary  conditions  in  the  z- 
direction.  The  transmission  coefficients  for  structures  described  by  the  planar  supercell 
stack  can  be  determined  by  the  direct  application  of  the  multiband  method  developed  by 
Ting  et  aL  [2] 


Side  View 


Interface  Layer 
Cross  Section 


Island 

Size 


4.9  dx 


10.6  d« 


Fig.  1.  A  set  of  plwar  supercell  stacks  used  in  simulating  double  barrier  structures  with 
interface  roughness.  Light  and  dark  areas  represent  sites  occupied  by  well  and  barrier 
materials,  respectively.  Cross  sections  shown  are  the  25  x  16  planar  supercells  representing 
the  rough  interfacial  layers  between  the  quantum  weU  and  the  second  barrier.  Note  that 
the  supercells  are  repeated  in  the  lateral  directions. 


111.  Applications 


To  illustrate  the  applications  of  the  plamar  supercell  method,  we  examine  the  effect  of 
interface  roughness  on  transmission  properties  of  double  barrier  structures,  and  effects  of 
neutrl  impurities  and  geometric  imperfections  in  quantum  wire  electron  waveguides. 

Interface  Roughness 

Figure  2  shows  transmission  spectra  near  the  n  =  1  resonance  for  a  set  of  GaAs/AlAs 
double  barriers  structures  with  interface  roughness  as  depicted  in  Figure  1.  For  each 
GaAs-AlAs  interface  on  the  incident  side,  a  50%  random  coverage  rough  interfacial  layer 
is  placed  between  the  pure  GaAs  layers  and  the  pure  AlAs  layers.  Random  configurations 
of  interfacial  layers  with  different  island  sizes  are  generated  with  a  simulated  annealing 


algorithm. [3]  Three  of  the  structures  have  average  island  sizes  (A)  of  28  A,  49  A,  and  106 
A;  an  additional  reference  structure  with  smooth  virtual  crystal  approximation  (VGA) 
Alo.5Gao.5A8  alloy  interfacial  layers  is  included  for  comparison.  We  note  that  the  spectra 
for  structures  with  rough  interfaces  show  a  series  of  satellite  peaks  not  present  in  the 
reference  structure  spectrum.  The  satellite  peak  strength  increases  with  island  size,  but 
peak  positions  are  approximately  the  same  for  all  three  rough  structures.  The  satellite 
peaks  are  the  result  of  interface  roughness  induced  k||  scattering  of  off-resonance  states 
into  on-reson^mce  states.  In  principle,  interface  roughness  can  scatter  a  given  k||  into  a 
continuous  range  of  k'||.  But  due  to  the  finite  supercell  size,  k||  can  scatter  only  into  a 
discrete  set  of  N,  x  iV,  parallel  k  vectors  differing  from  it  by  a  reciprocal  lattice  vector. 
Therefore  the  contributions  from  the  scattered  states  appear  as  individual  satellite  peaks. 
The  spacing  between  satellite  peaks  can  be  decreased  with  larger  supercell  sizes.  For 
sufficiently  large  superceUs,  the  spacings  would  become  smaller  than  the  resonance  peak 
width,  and  the  satellite  peaks  would  then  coalesce. 


GaAs/AlAs  DBH  w/  Rough  Interfaces 


Fig.  2.  Transmission  coefficients  for  a  set 
of  GaAs/AlAs  double  barrier  structures  with 
rough  interfaces  as  depicted  in  Fig.  1. 


GaAs/AlAs  DBH  w/  Rough  Interfaces 


Fig.  3.  Transmission  coefficients  for  structures 
sii^ar  to  those  shown  in  Fig.  2,  except  that 
the  island  sizes  are  larger. 


The  scattering  mechanism  described  above  accounts  for  how  interface  roughness  can  re¬ 
duce  peak-to-valley  ratios  in  current-voltage  characteristics  of  double  barrier  resonant 
tunneling  diodes  increasing  off-resonance  transmission  coefficients.  The  other  major 
effect  of  interface  roughness  is  the  broadening  of  resonance  peaks  through  wave  function 
localization.  In  Figure  3  we  show  transmission  spectra  for  a  set  of  structures  similar  to 
those  in  the  previous  example,  but  with  larger  island  sizes  of  112  A,  304  A,  and  424  A. 
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We  note  that  the  n  =  1  transmission  resonance  down-shifts  and  broadens  with  increasing 
island  size.  This  can  be  explained  in  terms  of  wave  function  localization.  The  presence  of 
the  rough  interfacial  layer  between  the  quantum  well  and  the  second  barrier  introduces 
well  width  fluctuation,  and  in  effect  divides  the  structure  into  wide- well  {Lw  =  13,  in 
this  example)  and  narrow- well  {Lw  =  12)  regions.  If  the  island  sizes  are  sufficiently  large 
(compared  to  the  de  Broglie  wavelength),  it  is  possible  to  have  quantum  well  states  whose 
wave  functions  are  laterally  localized  in  the  wide-well  regions.  Since  the  wide-weU  regions 
are  also  the  narrow-barrier  regions  (second  barrier  width  of  =  4  rather  than  5),  the 
transmission  resonance  associated  with  these  localized  states  should  be  down-shifted  (due 
to  the  wider  well),  and  broadened  (due  to  the  narrower  barrier). 

GaAs/AIAs  Electron  Waveguide  GaAs/AIAs  Electron  Waveguide 

L,a20nni  CiM*  S*cla4a4nm’  iSiiec^al  M.Snm  L,a20nm  XZMtaSSx3.Snni’  l7li17a.Mll  M.Snm 


Enww  (SV)  EnaiBy  (•V) 

Fig.  4.  TVansmission  coefficients  for  a  set  of  Fig.  5.  li’ansinission  coefficients  a  set  of  quan- 
quantum  wire  electron  waveguide  with  differ-  turn  wire  electron  waveguide  with  and  without 
ent  waveguide  opening  geometries.  neutral  impurity. 

Electron  Waveguide 

We  explore  the  transmission  properties  of  quantum  wire  electron  waveguides  with  the 
planar  superscell  method.  Using  discretization  sizes  of  =  dg  =  —  bA  and  16  x  16 

planar  supercells,  we  modeled  a  GaAs  quantum  wire  structure  200A  in  length,  40A  x  40A 
in  cross  section,  surrounded  on  the  sides  by  AlAs  walls,  and  the  ends  by  GaAs  electrodes. 
To  study  the  sensitivity  of  transport  properties  to  waveguide  opening  geometry,  we  have 
modeled  two  similar  structures.  The  first  has  slightly  wider  waveguide  openings  :  at  the 
two  ends  of  the  waveguide,  the  cross  section  is  widened  to  60A  x  60A  for  the  first  5  A 
(lengthwise),  and  SOA  x  50A  for  the  next  5  A;  the  rest  of  the  wire  remains  40A  x  40A  in 
cross  section.  The  other  structure  is  obtained  by  simply  capping  the  ends  of  the  original 
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structure  with  15  A  thick  AlAs  layers.  Figure  4  shows  the  transmission  spectra  for  the 
three  structures  described  above.  We  see  that  the  amplitude  of  the  transmission  coeffi¬ 
cients  and  the  resonance  positions  and  widths  vary  considerably  in  these  structures. 

In  the  final  example  we  study  the  effect  of  neutral  impurities  on  transport  properties 
of  electron  waveguides.  We  model  three  waveguide  structures  :  one  with  no  impurities, 
another  with  an  attractive  impurity  at  the  center  of  the  waveguide,  and  a  third  with 
an  impurity  near  the  waveguide  opening.  Figure  5  provides  a  picture  of  how  an  impu¬ 
rity  perturbs  the  waveguide  transmission  spectrum.  We  note  that  an  impurity  near  the 
waveguide  opening  does  not  strongly  perturb  the  lowest  two  resonances  since  these  modes 
have  small  probability  densities  near  the  waveguide  opening.  However,  an  impurity  at  the 
waveguide  center  down-shifts  and  narrows  the  lowest  (n=l)  resonance  while  leaving  the 
n=2  mode  relatively  unperturbed.  (Note  that  the  n=3  mode  has  also  been  down-shifted 
to  near  the  n=2  resonance.)  This  is  because  the  n=l  resonance  has  the  largest  probability 
density  at  the  center  of  the  waveguide,  while  the  n=2  mode  has  a  node  at  the  center. 

IV.  Summary 

We  discussed  the  planar  supercell  method  as  a  means  for  treating  3D  quantum  transport  in 
mesoscopic  tunnel  structures.  We  demonstrated  its  applications  with  studies  of  interface 
roughness  in  double  barrier  heterostructures,  and  geometric  imperfecti  us  in  electron 
waveguides.  In  addition  to  the  examples  discussed  here,  we  have  also  use  our  method 
to  study  the  effect  of  alloy  disorder  and  impurity  scattering  in  various  device  geometries. 
We  found  the  flexibility  of  the  method  to  be  extremely  useful  in  exploring  a  wide  range 
of  issues  relevant  to  the  operation  of  tunnel  devices. 
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Abstract 

We  extend  the  Quantum  Transmitting  Boundary  Method  (QTBM),  a  numerical  algorithm 
for  solving  the  two-dimensional  Schrddinger  equation  for  scattering  states,  to  include  an  ap¬ 
plied  magnetic  field  to  2D  systems.  We  apply  this  technique  to  simulate  the  magneto-transport 
of  a  periodically  corrugated  electron  channel.  The  conductance  quantization  of  such  structure 
is  recovered  when  the  channel  is  long.  The  index  of  this  quantization  is  a  non-monotonic 
function  of  energy. 


I.  Introduction 

For  structures  of  ultra-submicron  dimension,  electron  transport  is  in  the  quantum  regime.  The 
shapes  of  these  devices  and  the  potential  la  Escapes  can  be  flexibly  tailored,  for  example, 
by  electro-static  gating  on  top  of  the  2DEGs.  Understanding  the  details  of  transpon  in  these 
structures  inevitably  involves  numerical  solutions  of  the  2D  SchrOdinger  equation. 

Figure  1  schematically  shows  the  type  of  structures  we  consider;  the  region  of  interest  can 
be  partitioned  into  several  input  and  output  lead  regitms  (n^Q;,...)  and  a  "device  region" 
(flo).  where  scattering  states  are  to  be  solved.  A  perpendicular  magnetic  field  is  present. 
For  device  modeling,  a  mode  matching  approach  is  widely  adopted  [1,2].  The  problem  with 
this  method  is  that,  though  useful  for  the  most  simple  structures,  it  is  difficult  to  apply  for 
arbitrary  potential  profiles.  A  general  numerical  algorithm,  the  Quantum  Transmitting  Bound- 


FIGURE  1.  The  problem  geom¬ 
etry.  Fi  is  the  boundary  between 
the  device  region  fioaixl  lead  L 
Fo  denotes  all  the  other  bound¬ 
aries,  where  the  wavefunctions 
vanish.  The  applied  magnetic 
field  B  is  perpcndicuiar  to  the 
2D  system. 
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ary  Method  (QTBM)  developed  by  Lent  and  Kirkner  [3],  allows  rntm  complicated  potential 
inofile  and  can  cope  with  complicated  devices.  It  has  proven  to  be  an  efficient  technique  in 
device  modeling  in  [4]. 


II.  Model  And  Method 


We  extend  the  QTBM  to  include  the  effects  of  applied  magnetic  fields.  With  the  choice 
of  the  vectm  potential  in  Landau  gauge.  A  =  -Byx,  the  single-band  2D  effective-mass 
Schrbdinger  equation  becomes. 
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2m* 


,  tehBy 

V  + — 
m* 


e'^BW 
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(1) 


In  the  QTBM  algorithm,  we  expand  the  scattering  state  in  a  lead  region  as  a  superposition 
of  the  local  transverse  modes,  including  both  traveling  and  evanescent  ones.  The  modes  in 
the  lead  regions  are  obtained  by  solving  (Eq.  1)  as  a  quadratic  eigenvalue  problem  for  k  at 
given  energy  £,  using  the  form  ^(x,y)  =  Part  of  the  difficulty  of  the  problem  at 

non-zero  field  is  that  the  transverse  mode  eigenfunction,  the  x(y)’s,  are  not  orthogonal,  in 
contrast  to  the  zero  field  case.  Except  for  the  incident  modes,  the  complex  amplitude  of  each 
mode  is  an  unknown  for  which  we  must  solve.  In  the  Finite  Element  Method  scheme,  we 
are  able  to  properly  implement  the  boundary  conditions  and  obtain  the  numerical  solutions  of 
full  wavefunctions  in  the  whole  device  by  the  solution  of  a  single  linear  problem.  We  calcu¬ 
late  other  interested  physical  quantities  of  transport,  for  instance,  current  density  distribution 
and  transmission  coefficients  directly  from  the  wavefunction.  The  extended  QTBM  technique 
gives  us  the  capability  of  solving  the  magneto-transport  problem  for  arbitrarily  shaped  devices 
with  arbitrary  potentials,  as  that  of  Figure  1. 


( 


III.  Application  and  Results 


As  an  important  example,  we  present  the  results  of  this  technique  applied  to  a  corrugated 
quantum  channel  (Figure  2)  which  illustrates  the  recovery  of  conductance  quantization  for 
long  periodically  modulated  channels.  The  conducting  width  is  periodically  modulated  be- 
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FIGURE  2.  A  periodically  nodoiatcd  channel.  The  con- 
dncting  width  altematci  between  d  and  d-h.  A  nnit  cell 
b  aurhed  with  dashed  lines  and  Dhistrated.  A  magnetic 
field  is  applied  in -t-s-direction. 
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E/El 

FIGURE  3.  Numerical  results  for  the  structure  in  Figure  2  at  d/a^2.0,  hltt=0.6,  wla-0.4  and  ^24 J. 

(a)  TVansmission  coefficient  for  a  channel  of  N^SO  unit  cells.  The  incident  wave  is  in  tht  first  travel¬ 
ing  mode;  (b)  the  incident  is  in  the  second  mode;  (c)  the  incident  is  in  the  third  mode,  (d)  The  Lan- 
dauer  Conductance,  showing  the  quantization  in  a  non-monotonic  way  as  the  function  of  energy. 

(e)  The  bandstructure  of  an  infinite  modulated  channel.  The  index  of  the  conductance  plateau  in 
(d)  has  an  one-to-one  correspondence  to  the  number  of  positive-velocity  states  at  a  given  energy  in 
the  band  diagram. 

tween  d  and  d-h  with  a  periodicity  of  a  and  N  periods.  The  full  solution  of  wavefunctions  and 
scattering  matrix  is  obtained  for  one  unit  cell  (one  corrugation)  and  the  scattering  matrix  for 
the  structure  of  N  unit  cells  (periods)  is  calculated  using  a  cascading  method.  The  Landauer 
two-terminal  conductance  is  then  calculated  straightforwardly. 

Plotted  in  Figure  3  are  the  numerical  results  of  transmission  and  conductance  as  a  function 
of  energy  at  a  modestly  high  magnetic  field.  The  eneigy  is  expressed  in  units  of  first  bulk 
Landau  level,  El  =  (/iu)c)/2,  where  Uc  =  (eB)fm''  is  the  cyclotron  frequency;  the  magnetic 
field  is  expressed  in  dimensionless  form  by  0  =  (<Zo)//^,  where  =  h/{eB)  is  the  magnetic 
length.  For  the  results  presented  here,  the  dimensions  of  the  device  are  set  as  d/a  =  2.0, 
h/a  =  0.6,  wla  =  0.4.  The  effective  mass  is  chosen  to  be  m  *  /m  =  0.067,  which  is 
appropriate  for  GaAs.  Hard  wall  potentials  outside  the  channel  and  zero  inside  are  chosen 
for  simplicity. 
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FIGURE  4.  Particle  curreat  density  distribution  of  the  scattering  states  in  the  device  region  of  a  sin' 
gle  unit-cell  channel  at  Highlighted  is  the  potential  barrier  ‘linger”  which  causes  the 

mixing  of  edge  channels  in  the  device,  (a)  The  input  is  in  the  first  transverse  mode;  (b)  the  input  is 
in  the  second  transverse  mode. 


Drawn  in  Figure  3<a),  (b)  and  (c)  are  the  mode  transmission  coefficients,  Ti,T2,T3,  when 
the  input  is  in  the  first,  second,  and  third  transverse  mode  respectively.  The  channel  has  50 
periods  of  modulation.  We  see  a  complicated  pattern  of  mode  transmission  as  a  function  of 
energy  and  none  of  the  single  mode  transmission  coefficients  is  quantized. 

In  Figure  3(d)  we  plot  the  two-terminal  Landauer  conductance,  G  =  x  shows  that 

for  the  long  channel  of  SO  periods,  although  none  of  the  individual  mode  transmissions  shows 
the  quantization,  the  total  conductance  as  function  of  energy  is  essentially  quantized.  The 
height  of  the  plateaus  is  a  non-monotonic  function  of  energy. 

In  comparison,  we  also  calculated  the  bandstructure  of  the  infinite  periodic  system  and  plot  it 
in  the  reduced  Brillouin  zone(Figure  3(e)).  We  find  the  index  of  each  conductance  plateau  has 
a  one-to-one  correspondence  to  the  number  of  positive-velocity  (slope)  bands  in  the  band- 
structure  of  the  infinite  problem  [S].  The  shadings  in  Figure  3(d)  and  3(e)  illustrate  this 
ctNTCspondence.  Full  understanding  of  the  variations  in  conductance  from  the  exact  integers 
is  for  further  study.  One  reason  is  that  the  channel,  although  long,  is  still  finite. 

For  an  incident  energy  of  E/Ei,=3.l5,  marked  with  the  dashed  line  in  Figure  3(a),  there 
are  two  allowed  traveling  modes  in  the  lead  regitm.  Hgures  4  illustrates  the  particle  current 
density  distributions  of  a  one-cell  channel.  The  modulation  causes  the  mixing  of  edge  states 
in  the  device  region  and  makes  the  single  mode  tnuismitting  pattern  complicated.  Shown  in 
Figure  4(a)  is  the  particle  current  density  for  elections  incident  in  the  first  mode  (edge-state). 
The  tqrposite-going  edge  states  on  t^rposite  side  walls  of  the  channel  are  weakly  coupled 
in  one  unit  cell,  so  the  resulting  transmission  coefficient  is  risO.96.  Figure  4(b)  shows  the 
current  for  electrons  incident  in  the  second  mode.  The  transmission  coefficient  is  Tj^O.S? 
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for  one  unit  cell.  For  the  long  channel  of  SO  unit  ceils,  we  have  Ti  =  0.34  and  Tj  =  0.66 
which  sum  to  unity  for  the  total  transmission  and  make  the  plateau  index  of  the  quantized 
conductance  1. 


VI.  Summary 

We  have  extended  the  QTBM  algorithm  to  include  an  applied  magnetic  field  to  2D  sys¬ 
tems  for  solving  cunent  carrying  states  governed  by  effective  2D  SchrOdinger  equation.  In 
this  algorithm,  we  have  acquired  the  capability  of  modeling  of  magneto-transport  with  com¬ 
plicated  potendal  prtrfiles  and  arbitrarily  shaped  devices.  This  technique  is  applied  to  studying 
of  a  periodically  corrugated  quantum  channel.  The  results  illustrate  that  the  total  conductance 
quantization  recovers  when  the  modulated  channel  is  sufficiently  long.  The  plateau  index  is 
a  non-monotonic  function  of  energy.  However,  the  individual  mode  transmissions  are  not 
quantized. 
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Abstract 


We  discuss  a  numerical  method  for  computing  the  electronic  scattering  states  for  a  fully 
two-dimensional  open  boundary  scattering  domain.  The  scattering  states  may  then  be 
used  to  obtain  the  local  electron  density  in  the  vicinity  of  the  scatterers  which  is  necessary 
for  a  numerical  study  of  the  residual  resistivity  dipole  and  the  “electron  wind”  force 
relevant  to  electromigration.  The  scattering  states  may  also  be  used  to  calculate  the  local 
electron  density  of  states  which  has  recently  been  directly  imaged  by  STM  experiments  on 
the  surface  of  copper.  Our  numerical  method  is  based  upon  the  partial  wave  expansion  of 
the  known  asymptotic  form  of  the  wave-function  on  the  solution  domain  boundary.  The 
wave-function  and  the  normal  derivative  are  then  matched  on  the  boundary  resulting  in 
a  linear  system  of  equations. 

I.  Introduction 

The  large  volume  of  recent  literature  on  the  study  of  electromigration  incre2tsingly  em¬ 
phasizes  the  importance  of  the  local  non-uniform  fields  near  scattering  centers.  It  is  clear 
that  a  detailed  understanding  of  the  local  fields  near  scatterers  is  needed  to  understand 
phenomena  in  which  the  residual  resistivity  dipole  (RRD)  [1-2]  and  Friedel-oscillations 
[3]  play  an  important  role.  Such  a  local  field  treatment  is  used  in  the  application  of  the 
Kubo  lineau’-response  formalism  to  compute  the  “electron  wind”  force  experienced  by  a 
scatterer  in  electromigration  [4-7]. 

The  importance  of  local  field  effects  near  scattering  centers  is  most  clearly  and  elegantly 
demonstrated  by  the  recently  published  scanning  tunnelling  microscope  (STM)  experi¬ 
ments  performed  on  the  Cu(lll)  surface  by  Cronunie  et  al.  [8-9].  In  these  experiments, 
the  local  density  of  states  (LDOS)  of  the  two-dimensional  electron  gas  (2DEG)  on  the 
stepped  surface  of  Cu(lll)  was  directly  probed  by  and  STM  tip  at  low  temperature.  The 
images  of  the  LDOS  revealed  standing-wave  patterns  due  to  electron  scattering  from  step 
edges  and  defects. 

In  this  paper,  a  numerical  method  is  presented  in  which  the  two-dimensional  electronic 
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scattering  states  are  obtained  by  solving  the  effective  mass  Schrodinger  equation  over  a 
2D  domain  with  an  open  boundary.  The  method  uses  the  paritd  wave  expansion  of  the 
known  asymptotic  solution  and  matches  the  wave-function  and  its  normal  derivative  on 
the  boundary.  The  resulting  system  of  linear  equations  can  be  solved  by  the  finite  element 
method.  These  scattering  states  may  then  be  used  to  compute  the  electron  density  and 
the  LDOS  inside  the  scattering  domain.  A  self-consistent  treatment  would  require  an 
iterative  solution  of  scattering  states  using  a  Hartree  potential  [10]. 

The  physical  model  for  the  partial  wave  expansion  is  presented  in  section  11.  Section  111 
presents  the  finite  element  formulation  of  the  problem.  An  example  solution  for  a  repulsive 
scatterer  is  presented  in  section  IV.  In  conclusion,  we  summarize  our  method  in  Section 
V. 

II.  Model 

We  view  electron  transport  in  the  spirit  of  Landauer’s  picture  for  the  residual  resistivity 
dipole  (RRD)  [1-2]  in  which  the  incident  carrier  flux  is  identified  as  the  fundamental 
driving  transport  quantity  in  a  “jellium”  model  with  a  background  scattering  time  r 
which  gives  rise  to  the  bulk  resistivity  po  =  m*/ne^r,  where  n  is  the  electron  density 
and  m*  is  the  electron  effective  mass.  As  is  schematically  depicted  in  Fig.  1,  the  incident 
electron  flux  is  elastically  scattered  by  the  defect,  schematically  shown  as  the  shaded  spot, 
and  is  partially  transmitted  and  partially  reflected.  For  the  metallic  “jellium”  model,  the 
problem  domain  is  assumed  to  be  in  the  ballistic  regime  and  scattering  is  assumed  to 
take  place  within  the  region  close  to  the  scatterer  such  that  'Iirkpl  »  1,  where  kf  is  the 
wave  vector  for  the  electrons  at  the  Fermi-energy,  and  /  is  the  mean-free  path  given  by 
/  =  fikfr/m’.  Outside  the  scattering  region  the  domain  is  assumed  to  uniformly  extend 
to  infinity. 


FIGURE  1:  A  schematic  diagram  of  the  2-D  scattering  domain  with  an  elastic  scatterer 
at  the  center.  The  solution  domain  is  represented  by  the  hatched  region. 

Within  this  “jellium”  model,  the  self-consistent  electronic  states  can  be  explicitly  obtained 
by  solving  the  single  electron  effective  mass  Schrodinger  equation, 

~  +  (Vr  +  Vh  +  =  E'f’E{r,0)', 


(1) 


where  V/i(r)  is  the  potential  of  the  scatterer,  Vh  is  the  self-consistent  Hartree  potential 
which  accounts  for  many-body  Coulombic  interactions,  and  Vsc  is  the  electrostatic  po¬ 
tential  associated  with  self-consistent  screening.  Exchange  and  correlation  may  also  be 
included  within  a  local  density  approximation  [11].  The  z  dependence  has  been  dropped 
by  the  assumption  that  only  the  first  subband  of  the  2DEG  is  occupied,  reducing  the 
problem  to  circular-polju'  corrdinates. 

We  assume  an  incident  plane  wave  of  the  form  exp(tk  •  r).  The  asymptotic  solution  of 
this  equation  in  the  region  where  the  scattering  potential  is  negligible  (the  region  outside 
the  domain)  is  given  by 

V’(r>flo,0)=  L  (2) 


where  we  have  used  the  Jacobi-Anger  expansion  of  the  incident  wave,  and  Jm  is  the 
Bessel  function  of  the  first  kind  with  a  known  incident  amplitude  a,  and  is  the 
Hankel  function  for  the  outgoing  scattered  wave  with  the  unknown  amplitudes  bm- 


Similar  to  the  development  of  the  quantum  transmitting  boundary  method  for  quasi-lD 
transport  [12],  the  2-D  transport  boundary  condition  is  developed  from  the  orthogonality 
of  the  angular  modes  over  the  interval  6  =  (0,2jr),  which  are  used  to  expand  the  unknown 
coefficients,  on  the  boundary  of  the  domain. 


'Hii'(kRo) 


HS'ikfk) 


Using  this  expamsion  for  the  coefficients,  the  normal  derivative  of  the  wave-function 
on  the  boundary  can  be  obtained, 


dr  Ifio 


=  ak  Z  i^'Ij'JkRo)- 
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(4) 


where  the  primes  on  the  Bessel  and  Hankel  functions  indicate  derivatives  with  respect  to 
kr.  Equations  (4)  and  (5)  form  the  basis  for  the  open  2-D  scattering  boundary  conditions 
which  can  be  incorporated  into  the  finite  element  method.  It  is  important  to  note  that 
the  orthogonality  of  the  angular  functions  requires  a  circular  domain  boundau'y. 


III.  Numerical  Method 

The  finite  element  method  is  used  to  solve  the  2-D  effective  mass  Schrodinger  equation 
on  the  discretized  domain  schematically  shown  by  the  grided  region  in  Fig.  1.  The  region 
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outside  the  problem  domain  is  assumed  to  extend  to  infinity  as  described  in  section  II. 
The  finite  element  method  is  used  to  linearize  the  Schrodinger  equation,  resulting  in  the 
weak  variational  form 


-T 

u 


u  +  •  NJn)  u 


2  Jao  m"  dr 


(6) 


where  u  is  the  vector  of  nodal  values  for  the  arbitrary  test  function  u  is  the  vector 
of  unknown  nodal  values  for  the  wave  function  0,  N  is  the  vector  of  global  orthonormal 
finite  element  shape  functions,  B  is  the  matrix  of  spatial  derivatives  of  the  N  vector  of 
shape  functions,  and  n^a  is  the  unit  normal  to  the  domain  boundary,  dfl.  The  result 
for  the  derivative  of  the  wave-function  on  the  domain  boundary  developed  in  section  II 
is  inserted  above  to  determine  the  right  hand  side  surface  terms.  The  resulting  linear 
system  of  equations  has  the  form 

*J(T  -h  V  C)u  =  u'^P,  (7) 


where 

T  = 

V  = 

C  = 

P  = 

The  infinite  sums  may  be  truncated  to  include  only  the  more  relevant  low  angular  mo¬ 
mentum  modes  with  minimal  loss  in  accuracy.  The  resulting  linear  system  may  be  solved 
by  standard  LU-decomposition  and  back  substitution  using  sparse  matrix  methods  for 
the  efficient  use  of  both  memory  and  cpu  resources. 
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rV.  A  Repulsive  Scatterer 

Our  preliminary  results,  neglecting  electron-electron  scattering  effects  contained  in  V// 
and  Vsc,  are  presented  for  the  structure  schematically  shown  in  Fig.  1.  The  infinite  re¬ 
pulsive  scatterer  is  centered  in  the  solution  domain.  The  mesh  generated  for  this  example 
contains  10801  nodes  connected  by  21500  triangular  elements.  Both  skyline  storage  and 
bandwidth  optimization  techniques  were  employed  for  an  efficient  computational  solu¬ 
tion  of  the  resulting  unsymmetric  linear  system  of  equations.  The  real  part  of  the  wave 
function  is  presented  in  Fig.  2.  The  radial  and  angular  modes  are  clearly  visible  in  the 
scattered  wave. 
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FIGURE  2:  The  real  part 
of  the  scattered  wave  func¬ 
tion  for  an  incident  plane 
wave  scatterd  by  a  repulsive 
central  field.  Both  the  in¬ 
cident  plane  wave  and  the 
radially  scattered  contribu¬ 
tions  are  clearly  visible.  The 
mesh  consists  of  10801  nodes 
with  21500  triangular  ele¬ 
ments. 

V.  Summary 

We  have  presented  a  numerical  method  to  solve  the  2-D  effective  mass  Schrodinger  equa¬ 
tion  for  an  open  boundary  scattering  problem.  The  method  uses  partial  wave  expansion 
to  fully  specify  the  normal  derivative  of  the  wave-function  on  the  boundary.  The  finite 
element  method  is  used  to  discretize  the  Schrodinger  equation.  The  partial  wave  bound¬ 
ary  conditions  are  used  to  fully  specify  the  problem  which  reduces  to  a  linear  system  of 
equations  which  can  be  solved  for  the  scattering  states.  The  scattering  states  may  then  be 
used  to  compute  the  local  density  of  states  and  the  electron  density  inside  the  scattering 
region. 
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Abstract 

We  investigate  the  line  shape  of  the  transmission  probability  in  quantum  waveguides 
with  resonantly-coupled  cavities.  The  lifetime  of  the  quasi-bound  states  is  extracted 
from  the  asymmetrical  transmission  amplitude  on  the  real-energy  axis.  The  reso¬ 
nance/antiresonance  feature  in  the  vicinity  of  each  quasi-bound  state  is  characterized 
by  a  zero-pole  pair  in  the  complex-energy  plsme.  We  develop  a  generalization  of  the 
familiar  symmetrical  Lorentzian  line  shape,  and  discuss  it  in  terms  of  Fano  resonances. 

I.  Introduction 

A  common  computational  problem  in  quantum  transport  is  to  find  the  lifetimes  of  quasi¬ 
bound  states  from  the  transmission  peaks.  The  underlying  theory  is  that  each  quasi¬ 
bound  state  leads  to  a  pole  of  the  propagator  (and  the  transmission  amplitude)  in  the 
complex-energy  plane.  If  this  pole  is  sufficiently  close  to  the  real-energy  ^u(is,  it  will 
result  in  a  resonance  maximum  of  the  observed  transmission  coefficient.  A  well  under¬ 
stood  problem  is  double-barrier  resonant  tunneling,  where  the  lifetimes  of  the  quantum 
well  states  may  be  extracted  from  the  width  of  Lorentzian-shaped  transmission  peaks.  A 
less  understood  problem  is  electronic  transport  in  quantum  waveguides  with  resonantly- 
coupled  cavities  [1].  It  is  known  for  these  structures  that  the  resonator  states  lead  to 
resonance/antiresonance  features  [2,  3],  but  their  detailed  line  shape  has  not  been  in¬ 
vestigated  so  far.  In  this  paper,  we  present  a  theory  of  the  line  shape  for  transmission 
in  resonantly-coupled  quantum  waveguide,  and  we  provide  a  computational  method  to 
extract  the  lifetimes  of  the  corresponding  quasi-bound  states. 

II.  Poles  and  Zeros 

For  double- barrier  resonant  tunneling  (DBRT),  it  is  well  known  that  the  resonant  trans¬ 
mission  phenomena  are  related  to  the  quasi-bound  states  in  the  quantum  well  region. 
Based  on  the  Breit-Wigner  formalism,  a  quasi-bound  state  at  energy  Ep  and  decay  time 
T  =  A/2r  yields  a  simple  pole  in  the  transmission  amplitude  t{z)  at  the  complex-energy 
z  =  Ep  —  iT  (4), 


f(z)  ~ 


z-(£p-tr)- 


If  this  pole  is  sufficiently  close  to  the  real-energy  axis  such  that  the  effect  of  other  poles 
can  be  neglected,  the  transmission  probability,  T(£)  =  |f(£)|^,  for  a  physical  energy  on 
the  real-energy  axis,  E,  is  given  by, 

which  gives  rise  to  a  transmission  resonance  with  a  Lorentzian  shape.  It  is  an  easy 
matter  to  extract  the  lifetime  of  the  quantum  well  states  from  the  width  of  the  observed 
transmission  peak. 

An  example  of  the  transmission  amplitude  for  DBRT  is  presented  in  Figs.  1(a)  amd 
(c),  where  the  double- barrier  structure  and  the  transmission  channnel  are  schematically 
shown  in  the  inset  of  Fig.  1(a).  In  this  example,  the  barrier  is  0.2  eV  high  and  3nm 
thick,  the  separation  of  the  barrier  is  20  nm.  We  see  the  poles  in  the  complex-energy 
plane  are  clearly  visible  in  the  contour  plot  of  the  absolute  value  of  f(r)  in  Fig.  1(c).  The 
Lorentzian  line  shape  of  the  transmission  probability  is  shown  in  Fig.  1(a). 

Figure  1.  Comparison  of  the 
structure  of  the  transmission 
amplitude  in  the  complex  en¬ 
ergy  plane  for  double-barrier 
resonant  tunneling  (poles) 
and  t-stubs  (zero-pole  pairs). 
For  DBRT,  (a)  shows  the 
transmission  probability  on 
the  real-energy  axis,  and  (c) 
gives  a  contour  plot  of  the  ab^ 
solute  value  of  the  transmis¬ 
sion  amplitude  in  the  com¬ 
plex  energy  plane.  For  the 
t-stub  structure,  the  corre¬ 
sponding  plots  are  shown  in 
(b)  and  (d),  respectively. 

Recently,  much  work  has  been  done  on  transmission  in  resonantly-coupled  quantum 
waveguide  systems,  and  rich  features  of  the  transmission  coefficient  have  been  found 
(resonance/antiresonance)  [5].  We  have  shown  that  the  transmission  amplitude  exhibits 
zero-pole  pairs  in  the  complex-energy  plane  for  this  kind  of  the  structure  [6].  As  a  con- 
squence,  zero-pole  pairs  lead  to  asymmetrical  transmission  resonance/antiresonance  fea¬ 
tures  on  the  real-energy  axis.  As  an  example,  we  show  the  behavior  for  a  t-stub  structure, 
which  is  schematically  shown  in  the  inset  of  Fig.  1(b).  It  consists  of  a  main  transmission 
channel  and  a  dangling  wire  of  length  L  =  10  nm.  Zero-pole  pairs  are  clearly  visible  from 
the  contour  plot  of  the  absolute  value  of  t(z:)  in  Fig.  1(d). 

III.  Line  Shape 

Based  on  the  zero-pole  pair  nature  of  the  resonances  in  quantum  waveguide  structures, 
we  make  the  following  ansatz  for  the  transmission  amplitude  in  the  vicinity  of  each  quasi- 
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( 


bound  stAte, 


t{z)  ~ 


U  -  Eo) 
z-{Ep-ir)  ■ 


(3) 

( 


Here,  Eq  and  (£/>— tT)  are  the  positions  of  the  transmission  zero  and  the  pole,  respectively. 

The  lifetime  of  the  quasi-bound  state  is  given  by  r  =  h/(2r),  as  for  the  case  of  double¬ 
barrier  resonant  tunneling.  Again,  the  transmission  probability  on  the  re2d-energy  axis 
is  given  by  T{E)  =  (<(£)(*,  and  the  proportionality  constant  in  eqn.  (3)  is  determined 
by  assuming  peaks  with  unity  transmission  which  are  known  to  occur  in  symmetrical  < 

waveguide  systems  [6].  A  unity  transmission  peak  at  energy  E\  provides  two  constraints 
for  T{E),  namely  T{Ei)  =  1  and  |£,  =  0.  It  is  an  easy  matter  to  show  that, 


r(£)  = 


F’ 

(E-Eo)^ 

[(Ep-Eo)»-hr2j 

[(E-Ep)»-hF2j 

(4) 


( 


The  above  expression  gives  the  line  shape  of  the  transmission  probability  for  resonantly- 
coupled  quantum  waveguides  in  terms  of  three  parameters,  namely  the  energy  of  the 
transmission  zero,  Eq,  the  energy  of  the  resonant  state,  Ep  (the  real  part  of  the  pole 
energy),  and  the  inverse  lifetime  of  the  state,  F  (the  imaginary  part  of  the  pole  energy). 
Note  that  (4)  produces  an  asymmetrical  line  with  a  resonance/ antiresonance  behavior. 
Such  asymmetrical  line  shapes  have  previously  been  noted  in  atomic  and  molecular  physics 
[7].  These  so-called  Fano  resonances  are  know  to  occur  when  a  bound  state  is  coupled  to 
a  continuum  of  states,  thereby  leading  to  resonance  phenomena  [8]. 

In  his  original  paper  [7],  Fano,  after  somewhat  lengthy  derivations,  found  that  the  au¬ 
toionization  cross  section  could  be  parameterized  by  (9-i-€)*/(H-«^),  where  e  is  a  reduced 
energy  (it  is  defined  as  e  =  (£  -  EfUi)!?,  where  Efte$  is  the  energy  of  the  resonant  state) 
and  q  is  treated  as  a  parameter  (it  is  a  complicated  expression  involving  matrix  elements). 
We  note  that  this  is  the  same  line  shape  as  our  eqn.  (4)  by  making  the  following  substitu¬ 
tions,  t  =  {E  ~  Ep)/r  and  q  =  (Ep  —  £o)/F.  Comparing  our  approach  to  F^mo’s  [7],  we 
note  that  c  has  a  similar  meaning  where  Ep,  the  real-part  of  the  pole  energy,  corresponds 
to  Epe,,  the  energy  of  the  resonant  state.  For  the  parameter  q,  our  approach  yields  a 
simple  expression  which  could  not  have  been  inferred  from  Ref.  [7].  Apparently,  Fano’s 
line  shape  corresponds  to  a  zero-pole  structure  in  the  complex-energy  plane,  a  fact  which 
has  not  been  noted  before. 


Given  a  certain  transmission  curve,  we  now  can  fit  each  resonance/antiresonance  feature 
to  obtain  the  lifetime  of  the  corresponding  quasi-bound  state.  Using  the  known  energies 
of  the  transmission  zero,  Eo,  and  transmission  one,  E\,  we  can  find, 

P  _(£o+£i)  ,  \/(£o  -  El)*  -  4r* 

Ep - 2 - ± - 2 - .  (5) 

The  choice  of  the  sign  in  the  above  equation  determines  whether  Ep  >  Eo  or  Ep  <  Eo- 
With  this,  the  only  unknown  parameter  is  F  which  may  be  used  to  obtain  the  best  fit  of 
the  theoretical  line  shape  (4)  to  the  given  transmission  curve.  We  seek  the  best  fit  in  the 
sense  of  the  least  mean  square  error. 
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IV.  Results 


We  now  present  several  examples  to  fit 
the  lineshapes  of  resonance  and  antires¬ 
onance  pairs.  Figure  2  presents  fits  of 
the  resonance/antiresonace  line  shapes  for 
a  family  of  so-called  weakly  coupled  t- 
stubs  [6],  which  are  schematically  shown 
in  the  insets.  In  Figs.  2(a),  (b),  and 
(c),  the  resonant  stubs  are  separated  from 
the  main  transmission  channel  by  a  tun¬ 
neling  barrier  of  length  ^  =  1.0  nm  and 
height  =  0.5  eV,  Vq  =  1.0  eV,  and 
Vo  =  2.0  eV,  respectively.  In  each  case,  we 
show  3  quasi-bound  states  which  lead  to 
zero-one  features  in  the  transmission  prob¬ 
ability,  and  which  are  labeled  in  the  plots. 
Figures  2(a),  (/3),  and  (7),  show  the  fitted 
line  shape  for  the  resonance  numbered  3  of 
cases  (a),  (b),  and  (c),  respectively.  The 
fit  is  shown  by  the  dotted  line,  and  the 
curve  to  be  fitted  by  the  solid  line. 


EmtstUV)  EauvCtV) 


Figure  3.  Example  of  fits  using  the  line  shape 
(4)  for  a  t-stub  with  a  double  barrier  structure 
on  the  transmission  channel  shown  in  the  inset  to 
(a).  The  fits  of  resonances  1  -  4  are  shown  in  (b) 
-  (e),  respectively. 
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Figure  2-  Examples  of  fits  using  the  line  shape  (4) 
for  the  weakly-coupled  t-stub  structures  shown  in 
the  insets.  The  fits  to  the  third  resonant  state  of 
(a),  (b),  and  (c)  are  given  in  (a),  (/?),  and  (7), 
respectively.  The  fits  are  shown  by  the  dotted  line, 
and  the  curve  to  be  fitted  by  the  solid  line. 

In  figure  3,  we  present  another  example 
which  shows  transmission  for  a  t-stub  in 
addition  to  double-barrier  resonant  tunnel¬ 
ing  on  the  main  transmission  channel.  A 
schematic  drawing  of  this  waveguide  struc¬ 
ture  is  displayed  in  the  inset,  and  the  two 
tunneling  barriers  have  a  thickness  of  1  nm, 
height  of  0.5  eV',  and  separation  of  4  nm. 
Figure  3(a)  shows  the  transmission  proba¬ 
bility  on  the  real-energy  axis.  Figures  3(b)  - 
(e)  display  the  fits  according  to  our  ansatz, 
eqn.  (3),  for  resonances  1-4,  respectively. 
Again,  the  fit  is  shown  by  the  dotted  line, 
and  the  curve  to  be  fitted  by  the  solid  line. 
It  appears  that  the  zero-pole  character  of 
each  quasi-bound  state,  les^ls  to  extremely 
good  fits  of  the  transmission  probability  in 
the  vicinity  of  each  resonance. 
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In  general,  the  locations  of  the  poles  and  the  zeros  on  the  real-energy  axis  are  not  the 
same,  i.e.  Ep  ^  £^.  It  is  this  fact  that  gives  rise  to  the  asymmetric  line  shape,  eqn.  (4). 
Note  that  from  eqn.  (5),  the  position  of  the  pole,  Ep,  is  always  between  the  positions  of 
the  transmission  zero  and  one,  Eq  and  Ei.  If  the  pole  and  the  zero  occur  at  the  same  real 
energy,  i.e.  Ep  =  Eq,  then  eqn.  (4)  yields  a  symmetric  line  shape, 

J  \Ep=Ec  (E-Ep^  +  r^' 

The  above  expression  describes  a  Lorentzian-shaped  reflection  line. 

In  recent  work  [3],  Price  has  pointed  out  that  a  resonant  quasi-bound  state  can  give  rise 
to  either  Lorentzian-shaped  transmission  or  reflection  peaks,  and  he  terms  these  peaks 
resonances  of  the  first  and  the  second  kind,  respectively.  We  see  that  the  reflection  peaks 
in  general  will  not  have  a  Lorentzian  shape,  and  that  Price's  resonances  of  the  second 
kind  2ure  recovered  when  Ep  =  £©• 

In  summary,  we  have  investigated  the  detailed  line  shape  of  the  trainsmission  probability 
in  quantum  waveguides  with  resonantly-coupled  cavities.  The  resonance/anti  resonance 
features  in  the  vicinity  of  each  quasi-bound  states  can  be  characterized  by  a  zero-pole  pair 
in  the  complex-energy  plane.  We  have  found  a  generalization  of  the  familiar  symmetrical 
Lorentzian  resonance  peaks.  Using  several  examples,  we  have  demonstrated  the  utility  of 
our  line  shape  (4)  to  extract  the  lifetime  of  the  quasi- bound  state  by  a  fit  to  the  data. 
We  also  discussed  the  asymmetrical  line  shapes  in  the  context  of  Fano  resonances. 
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Abstract 

Transport  responses  of  hot  electrons  in  quantum  wires  at  high  temperature  are  calculated 
including  both  optical  and  acoustic  phonon  scatterings  to  study  how  the  average  electron 
energy  and  the  drift  velocity  depend  on  the  applied  electric  field.  The  average  electron 
energy  is  found  to  increase  monotonically  with  the  applied  electric  field  showing  no  anoma¬ 
lous  carrier  cooling.  The  calculated  results  of  drift  velocity  at  lower  electric  fields  agree 
well  with  the  results  obtained  by  solving  linearized  Boltzmann  transport  equation.  The 
contribution  of  acoustic  phonon  scattering  to  the  trcuisport  response  is  also  discussed. 


I  Introduction 

In  recent  years,  mesoscopic  structures,  such  as  quantum  wires  and  dots,  have  been  de¬ 
signed  to  investigate  new  phenomena  and  to  explore  high  speed  devices.  In  quantum 
wires,  momentum  space  is  limited  to  a  single  dimension,  and  the  carrier  density  of  states 
(DOS)  shows  some  singularities  resulting  in  changes  in  the  nature  of  particle  collisions. 
It  is  pointed  out  that  at  a  moderate  electric  field  anomalous  carrier  cooling  of  electrons 
in  quantum  wires  enhances  the  transport  response  due  to  the  conversion  of  thermal  en¬ 
ergy  into  drift  motion  [1].  In  the  present  work,  we  are  interested  in  obtaining  transport 
responses  of  hot  electrons  in  quantum  wires  at  high  temperature  including  both  optical 
and  acoustic  phonon  scatterings  to  study  how  the  average  electron  energy  and  the  drift 
velocity  depend  on  the  applied  electric  field.  We  adopt  the  Rees’s  iterative  method  [2] 
to  evaluate  hot  electron  distribution  function,  since  the  method  provides  a  consistent 
treatment  of  the  divergence  of  DOS  of  electrons  and  the  scattering  probability  [3]. 

II  Model 

We  employ  a  simple  model  for  a  quantum  wire  in  which  a  two-dimensional  electron  gas 
in  zy-plane  is  confined  by  narrow  gates  or  split  gates,  and  electrons  are  free  along  only 
the  X  direction.  We  assume  that  the  wave  function  associated  with  quantized  z  motion 
is  expressed  by  the  Fzmg-Howard  variational  function  =  (^'^/2)‘^^xexp(— 562))  and 
the  confinement  in  the  y  direction  is  characterized  by  a  parabolic  potential  of  frequency 
^  (^(y)  =  We  consider  only  the  ground  subband,  i.e.  the  strong  confinement 

case.  Furthermore,  number  of  electrons  is  assumed  to  be  so  small  that  their  distribution 
is  given  by  the  Boltzmann  distribution  function  in  thermal  equilibrium. 


* 


Ill  Method 


We  evaluate  the  electron  distribution  function  by  using  the  iterative  method  propc»ed 
by  Rees  [2]  accounting  the  Fermi’s  golden  rule  to  calculate  the  scattering  probability.  In 
this  study,  we  consider  bulk  longitudinal  optical  phonon  scattering  and  acoustic  deforma¬ 
tion  potential  scattering,  while  other  scattering  processes  are  ignored  for  simplicity.  The 
distribution  function  f{k)  is  calculated  by  the  iterative  process  as  follows: 


f„^,{k)  =  r  rg„ik~eEt)e-^Ut, 

Jo 

(1) 

k  '■ 

(2) 

where  S(fc,  fc')  is  the  scattering  probability  per  unit  time  from  k  to  k'  and  F  is  the  total 
scattering  rate  including  self-scattering  process  which  is  positive  constant  and  determines 
the  convergence  of  the  iteration.  5(A:,  A:'),  which  includes  self-scattering  process,  is  evalu¬ 
ated  from  real  scattering  probability  W{k,k')  as  foUows: 

S{k,k')  =  W{k,k')-{-S{k)Sk,k'  (3) 

with  S{k)  being  the  self-scattering  rate.  W{k,k')  is  given  by  the  sum  of  optical  and 
acoustic  phonon  scattering  probabilities: 

W{k,  k')  =  Y.  (?)  (M)  +  I  +  5»?)  G{k'  -  k)  6{ek-  -f  r?a;o  -  £*) 

T)=±l  ''*0^ 

-f-  Stt/Jwo  (^)  /v  fz  S{€k'  -  £k)  (4) 


Figure  1:  Total  scattering  rates  in  a  GaAs  quantum  wire  (solid  line). 
1  /6  =  40  A,  /?  =  50  meV  and  T  =  300  K.  Dashed  line  :  rates  for  optical 
phonon  scattering.  Dotted  line  :  rates  for  acoustic  phonon  scattering. 
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Figure  2:  Distribution  functions  f{k)  as  a  function  of  electron  energy 
6k  =  k^l2m  with  applied  electric  fields  (0,  200  and  500  V/cm)  switched 
on  for  loops.  1/6  =  40 A,  1?  =  50meV  and  T  =  300 K.  Solid  lines 
represent  f{k)  including  both  optical  and  acoustic  phonon  scatterings, 
and  dotted  lines  are  f{k)  without  acoustic  phonon  scattering. 


with 

nQ),  F(Q)  =  (5) 

where  a  is  the  Frdhlich  coupling  constant,  wo  is  the  optical  phonon  energy,  No  is  the  occu¬ 
pation  number  of  optical  phonons,  (3  =  D^mko/2ps^,  D  is  the  deformation  potential,  p  is 
the  density  of  the  material,  s  is  the  sound  velocity,  fy  =  {K|ko)|{2^/v),  =  (3/16)(6/fco), 

K  =  {2mf2)^/^,  ko  =  {2rruiJoY^^  and  Q  =  (?*  4-  Note  that  non-elastic  effects  of 

acoustic  phonon  scattering  are  ignored  in  the  present  calculation. 

Calculated  scattering  rates  of  a  GaAs  quantum  wire  2U’e  shown  in  Fig.  1.  Scattering 
rates  by  acoustic  phonons  are  larger  than  those  of  optical  phonons  for  low  energy  electrons 
because  of  the  singularity  of  DOS  at  the  bottom  of  subband. 


IV  Results 

Figure  2  shows  calculated  results  of  distribution  functions  with  applied  electric  fields  (0, 
200  and  500  V/cm)  switched  on  for  100  ps.  In  the  case  of  zero  applied  electric  field,  the 
distribution  function  coincides  with  the  Boltzmann  distribution  function.  This  clearly 
indicates  that  the  iterative  method  is  valid  for  evaluating  transport  responses  of  one- 
dimensional  electron  gases. 

In  Fig.  3  we  show  the  calculated  average  electron  energy  £,»,  in  a  quantum  wire  of 
GaiAs  as  a  function  of  applied  electric  field  E.  We  find  that  e^ve  increases  monotonically 
with  E  and  an  anomalous  carrier  cooling  does  not  occur.  The  calculated  result  of  drift 
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Figure  3:  Average  electron  energy  e»v*  in  a  GaAs  quantum  wire  as 
a  function  of  applied  electric  fields.  Thermal  kinetic  energy  {^kT)  is 
represented  by  dotted  line.  1/5  =  40A,  f}  =  50meV  and  T  =  300  K. 
Solid  circles  indicate  including  both  optical  and  acoustic  phonon 
scatterings,  and  open  circles  are  £ave  without  acoustic  phonon  scattering. 
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Figure  4:  Electron  drift  velocity  in  a  GaAs  quantum  wire  as  a  function  of 
applied  electric  fields  (solid  circles),  compaired  with  the  results  obtained 
by  solving  linearized  Boltzmann  transport  equation  (dotted  line).  Both 
optical  and  acoustic  phonon  scatterings  are  considered.  1/6  =  40  A, 
f]  =  50meV  and  T  =  300  K. 
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velocity  is  also  shown  in  Fig.  4.  The  drift  velocity  at  lower  electric  fields  agrees  well 
with  the  results  obt2dned  by  solving  linearized  Boltzmann  transport  equation  At  higher 
electric  fields,  the  drift  velocity  saturates  as  the  electron  temperature  increases.  Although 
electron  energy  and  drift  velocity  are  influenced  only  a  little  by  acoustic  phonon  scattering 
in  the  case  of  1/6  =  40  A  and  17  =  50  meV,  it  is  not  too  small  to  be  neglected. 

Figure  5  shows  a  contribution  of  acoustic  phonon  scattering  to  the  drift  velocity.  The 
contribution,  is  defined  its  follows: 

=  1  _  (6) 

Wop 

where  Wop+»c  is  the  drift  velocity  calculated  by  including  both  optical  and  acoustic  phonon 


electric  field  (V/cm) 

Figure  5:  Contribution  of  acoustic  phonon  scattering  to  the  drift  veloc¬ 
ity  for  17  =  50  and  70meV.  The  contribution,  is  defined  in  the  text. 

1/6  =  40  A  and  r  =  300  K. 

scatterings,  and  Vop  is  that  without  acoustic  phonon  scattering.  The  contribution  of 
acoustic  phonon  scattering  becomes  larger  in  the  case  of  low  applied  electric  field  E  or 
strong  confinement  17. 
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